Summary
| Distribution | pmf | Mean | Variance | mgf |
|---|---|---|---|---|
| Discrete Uniform | ||||
| Bernoulli | ||||
| Binomial | ||||
| Poisson | ||||
| Negative Binomial | ||||
| Geometric | ||||
| Hypergeometric | — |
Discrete uniform distribution
Finite number of outcomes are equally likely
- pmf: ,
- mean: (when )
- var:
- mgf:
Bernoulli distribution
Definition
Models random experiment whose outcomes are “success” or “failure”.
Link to original
Binomial distribution
Definition
Models the number of successes in a fixed number of independent Bernoulli trials, each with the same probability of success
is the amount of success outcomes
Link to original
Trinomial distribution
Definition
Like binomial, but with 3 possible outcomes. Special case of multinomial with .
- pmf: , where
- mean:
- var:
- mgf:
Conditional distribution:
Correlation coefficient:
Link to original
Multinomial distribution
Definition
Extension of binomial distribution to categories.
is the number of success in the -th variable.
Link to original
Negative binomial distribution
Definition
Distribution of number of failures () needed to get the -th success
Link to original
- pmf: ,
- mean:
- var:
- mgf: ,
Poisson distribution
Definition
Models number of events occurring in fixed intervals of time/space when:
Notes
When modeling count data, use negative binomial distribution instead of poisson distribution to handle overdispersion.
Count data refers to numerical data that represents the number of times an event occurs.
The Poisson distribution is suitable for count data when the mean and variance are approximately equal, assuming events occur independently and at a constant average rate.
However, in real-world count data, overdispersion often occurs, where the variance exceeds the mean due to factors like clustering or unobserved heterogeneity.
The negative binomial distribution accommodates this overdispersion by introducing an additional parameter to model the variance more flexibly.
Link to original
Geometric distribution
Special case of negative binomial distribution with . Distribution for number of failures needed for a single success.
Link to original
- pmf: ,
- mean:
- var:
- mgf: ,
Hypergeometric distribution
Distribution for number of successes in draws from finite population of size containing exactly successes (sampling without replacement)
- pmf: ,
- mean:
- var: