Probability Theory

Topics

Notes

Linked

Maximum Mean Discrepancy (MMD)

Need a computationally tractable distance measure between empirical distributions without requiring density estimation.

Maximum Entropy Principle

Many probability distributions are consistent with known constraints. Choose the one with maximum entropy as the least biased estimate.

Importance Sampling

How do you estimate expectations under a target distribution when you only have samples from a different distribution?

Monte-Carlo Estimation

Analytical computation of expectations is intractable. Approximate expectations by averaging random samples.

Variational Inference

Exact posterior inference is intractable for complex models. Approximate the posterior with a simpler distribution by minimizing KL divergence.

Jensen–Shannon Divergence

KL divergence goes to infinity when there is no support overlap resulting in no gradient

Jensen's Inequality

Proving convexity/concavity properties and establishing bounds like such KL non-negativity and convexity of loss functions.