Definition
Models number of events occurring in fixed intervals of time/space when:
Notes
When modeling count data, use negative binomial distribution instead of poisson distribution to handle overdispersion.
Count data refers to numerical data that represents the number of times an event occurs.
The Poisson distribution is suitable for count data when the mean and variance are approximately equal, assuming events occur independently and at a constant average rate.
However, in real-world count data, overdispersion often occurs, where the variance exceeds the mean due to factors like clustering or unobserved heterogeneity.
The negative binomial distribution accommodates this overdispersion by introducing an additional parameter to model the variance more flexibly.