About MANOVA
Multivariate Analysis of Variance (MANOVA) extends ANOVA to analyze multiple dependent variables simultaneously. It tests whether group means differ across several dependent variables, accounting for their intercorrelations.
MANOVA is used for multivariate data, where observations include multiple continuous dependent variables and one or more categorical independent variables (factors).
Examples include test scores across subjects (math, science) grouped by teaching method or physiological measures (heart rate, blood pressure) by treatment.
One-way MANOVA involves one independent variable (e.g., teaching method affecting math and science scores).
Two-way MANOVA includes two independent variables (e.g., teaching method and student gender), allowing analysis of main effects and interactions.
Performing MANOVA
- Define hypotheses:
- (Null hypothesis): No group differences across dependent variables.
- (Alternative hypothesis): Differences exist.
- Check assumptions: See Assumptions for MANOVA
- Prepare data: Organize data with categorical independent variables and continuous dependent variables.
- Compute test statistics: Use statistical software (e.g., Python’s
statsmodels, R’smanova) to calculate test statistics. See MANOVA formulas for manual computation, and Differences between MANOVA test statistics to pick most suitable test statistic for your data. - Evaluate significance: Compare p-values to significance level. Optionally, use MANOVA rejection rules.
Optional:
- Post-hoc analysis: If significant, conduct univariate ANOVAs or discriminant analysis to identify specific group differences.
- Interpret results: Report test statistics, p-values, and effect sizes (e.g., partial eta-squared). Discuss practical implications.
Assumptions for MANOVA
- Dependent variables are continuous and multivariate normally distributed.
- Homogeneity of variance-covariance matrices across groups (use Box’s M test).
- Linear relationships among dependent variables.
- No extreme outliers.
- Adequate sample size (more observations than dependent variables per group).
Formulas used for performing MANOVA
- Wilks’ Lambda:
- Pillai’s Trace:
- Lawley-Hotelling Trace:
- Roy’s Largest Root:
Base formulas
- : Number of dependent variables
- : Total number of observations across all groups
- : Number of groups
- is the vector of observations for the -th subject in the -th group
- is the mean vector for group
- is the overall mean vector
Derived formulas:
- : Error sum of squares
- : Hypothesis sum of squares
- : Eigenvalues of , where is the largest
- : Hypothesis degrees of freedom
- : Error degrees of freedom
F-transformation (when , ):
- , where , follows
MANOVA rejection rules
-
Wilks’ Lambda:
-
Pillai’s Trace:
Where:
-
Lawley-Hotelling Trace:
-
Roy’s Largest Root:
-
F-transformation (when , ):
Differences between MANOVA test statistics
-
Wilks’ Lambda ():
Measures ratio of error to total variance-covariance. Sensitive to group differences, widely used, but less robust to assumption violations (e.g., non-normality, unequal covariances). -
Pillai’s Trace ():
Sums eigenvalues of . Most robust to violations of assumptions (e.g., heterogeneity of variances), preferred for unequal sample sizes or non-normal data. -
Lawley-Hotelling Trace ():
Sums eigenvalues of . Powerful for detecting group differences, but sensitive to assumption violations, particularly non-normality. -
Roy’s Largest Root ():
Uses largest eigenvalue. Focuses on maximum group separation, powerful when differences are concentrated in one dimension, but less robust and not ideal for multiple group differences.