Abstract
Parametric assumptions for statistical tests include normality and equal variances. Micceri (1989) found that data frequently violate the normality assumption; variances have received less attention. We recorded within-group variances of dependent variables for 455 studies published in leading psychology journals. Sample variances differed, often substantially, suggesting frequent violation of the assumption of equal population variances. Parallel analyses of equal-variance artificial data otherwise matched to the characteristics of the empirical data show that unequal sample variances in the empirical data exceed expectations from normal sampling error and can adversely affect Type I error rates of parametric statistical tests. Variance heterogeneity was unrelated to relative group sizes or total sample size and observed across subdisciplines of psychology in experimental and correlational research. These results underscore the value of examining variances and, when appropriate, using data-analytic methods robust to unequal variances. We provide a standardized index for examining and reporting variance heterogeneity.
References
1960). The effects of violations of assumptions underlying the t test. Psychological Bulletin, 57, 49–64.
(1954). Some theorems on quadratic forms applied in the study of analysis of variance problems: I. Effect of inequality of variance in the one-way model. Annals of Mathematical Statistics, 25, 290–302.
(1981). Variance stabilizing transformations and the power of the F test. Journal of Educational Statistics, 6, 55–74.
(1980). Randomization tests. New York, NY: Marcel Dekker.
(1993). An introduction to bootstrap. London, UK: Chapman and Hall.
(1983). Curvilinear transformations of the dependent variable. Psychological Bulletin, 93, 382–387.
(2000). Heterogeneity of variance in clinical data. Journal of Consulting and Clinical Psychology, 68, 155–165.
(2005). Effect sizes for research: A broad practical approach. Mahwah, NJ: Erlbaum.
(2008). Sex differences in variability in general intelligence: A new look at the old question. Perspectives on Psychological Science, 3, 518–531.
(2008). A generally robust approach for testing hypotheses and setting confidence intervals for effect sizes. Psychological Methods, 13, 110–129.
(1998). Statistical practices of educational researchers: An analysis of their ANOVA, MANOVA, and ANCOVA analyses. Review of Educational Research, 68, 350–386.
(2004). The new and improved two-sample t test. Psychological Science, 15, 47–51.
(2008). A comparative study of robust tests for spread: Asymmetric trimming strategies. British Journal of Mathematical and Statistical Psychology, 61, 235–253.
(1995). Approximate degrees of freedom tests: A unified perspective on testing for mean equality. Psychological Bulletin, 117, 547–560.
(2004). Designing experiments and analyzing data: A model comparison perspective (2nd ed.). Mahwah, NJ: Erlbaum.
(1989). The unicorn, the normal curve, and other improbable creatures. Psychological Bulletin, 105, 156–166.
(1977). Assumption violations and rates of Type I error for the Tukey multiple comparison test: A review and empirical investigation via a coefficient of variance variation. Journal of Experimental Education, 46, 20–25.
(2008). A probability-based measure of effect size: Robustness to base rates and other factors. Psychological Methods, 13, 19–30.
(1986). Comparison of ANOVA alternatives under variance heterogeneity and specific noncentrality structures. Psychological Bulletin, 99, 90–99.
(1987). New designs in analysis of variance. Annual Review of Psychology, 61, 165–170.
(2001). Fundamentals of modern statistical methods: Substantially improving power and accuracy. New York, NY: Springer.
(2003). Applying contemporary statistical techniques. San Diego, CA: Academic Press.
(1986). New Monte Carlo results on the robustness of the ANOVA F, W, and F* statistics. Communications in Statistics: Simulation and Computation, 15, 933–944.
(2003). Modern robust data analysis methods: Measures of central tendency. Psychological Methods, 8, 254–274.
(