Summary

DistributionpmfMeanVariancemgf
Discrete Uniform
Bernoulli
Binomial
Poisson
Negative Binomial
Geometric
Hypergeometric

Discrete uniform distribution

Finite number of outcomes are equally likely

  • pmf: ,
  • mean: (when )
  • var:
  • mgf:

Bernoulli distribution

Definition

Models random experiment whose outcomes are “success” or “failure”.

Link to original

Binomial distribution

Definition

Models the number of successes in a fixed number of independent Bernoulli trials, each with the same probability of success

is the amount of success outcomes

  • pmf: ,
  • Mean:
  • Variance:
  • mgf:
Link to original

Trinomial distribution

Definition

Like binomial, but with 3 possible outcomes. Special case of multinomial with .

  • pmf: , where
  • mean:
  • var:
  • mgf:

Conditional distribution:

Correlation coefficient:

Link to original

Multinomial distribution

Definition

Extension of binomial distribution to categories.

is the number of success in the -th variable.

  • pmf:
  • Mean:
  • Variance:
  • Covariance:
Link to original

Negative binomial distribution

Definition

Distribution of number of failures () needed to get the -th success

  • pmf: ,
  • mean:
  • var:
  • mgf: ,
Link to original

Poisson distribution

Definition

Models number of events occurring in fixed intervals of time/space when:

  • Events occur at constant rate
  • Events are independent of one another
  • pmf: , ,
  • mean:
  • variance:
  • mgf:

Notes

When modeling count data, use negative binomial distribution instead of poisson distribution to handle overdispersion.

Count data refers to numerical data that represents the number of times an event occurs.

The Poisson distribution is suitable for count data when the mean and variance are approximately equal, assuming events occur independently and at a constant average rate.

However, in real-world count data, overdispersion often occurs, where the variance exceeds the mean due to factors like clustering or unobserved heterogeneity.

The negative binomial distribution accommodates this overdispersion by introducing an additional parameter to model the variance more flexibly.

Link to original

Geometric distribution

Special case of negative binomial distribution with . Distribution for number of failures needed for a single success.

  • pmf: ,
  • mean:
  • var:
  • mgf: ,
Link to original

Hypergeometric distribution

Distribution for number of successes in draws from finite population of size containing exactly successes (sampling without replacement)

  • pmf: ,
  • mean:
  • var: