Abstract
Abstract. We addressed potential test takers’ preferences for women or men as examiners as well as how examiners were perceived depending on their gender. We employed an online design with 375 students who provided preferences for and ratings of examiners based on short video clips. The clips showed four out of 15 psychologists who differed in age (young vs. middle-aged) and gender giving an introduction to a fictional intelligence test session. Employing multivariate multilevel analyses we found female examiners to be perceived as more social competent and middle-aged examiners being perceived as more competent. Data analyses revealed a significant preference for choosing women as examiners. Results were discussed with reference to test performance and fairness.
References
1969). The influence of examiner race on first-grade and Kindergarten subjects’ Peabody Picture Vocabulary Test Scores. Journal of Educational Measurement, 6, 241–246. doi: 10.1111/j.1745-3984.1969.tb00683.x
(1954). The structuring of events – outline of a general theory with applications to psychology. Psychological Review, 61, 281–303. doi: 10.1037/h0062678
(1999). Standards for Educational and Psychological Testing. Washington, DC: American Educational Research Association.
. (2010). Bayesian analysis of latent variable models using Mplus, Unpublished manuscript. www.statmodel.com/download/BayesAdvantages18.pdf
(2006). A comparison of Bayesian and likelihood-based methods for fitting multilevel models. Bayesian Analysis, 1, 473–514. doi: 10.1214/06-BA117
(2000). Stereotypes as dynamic constructs: Women and men of the past, present, and future. Personality and Social Psychology Bulletin, 26, 1171–1188. doi: 10.1177/0146167200262001
(2002). Paternalistic and envious gender stereotypes: Testing predictions from the stereotype content model. Sex Roles, 47, 99–114. doi: 10.1023/A:1021020920715
(European Commission (2011). Report on progress on equality between women and men in 2010 – The gender balance in business leadership. Luxembourg: Publications Office of the European Union, 2011. doi: 10.2767/99441
1999). Psychological assessment: Future challenges and progresses. European Psychologist, 4, 248–262. doi: 10.1027/1016-9040.4.4.248
(2007). Universal dimensions of social cognition: Warmth and competence. Trends in Cognitive Sciences, 11, 77–83. doi: 10.1016/j.tics.2006.11.005
(1990). A continuum of impression-formation, from category-based to individuating processes – influences of information and motivation on attention and interpretation. Advances in Experimental Social Psychology, 23, 1–74.
(1996). Social cognition (2nd ed.). New York, NY: McGraw Hill.
(2004). Bayesian data analysis (2nd ed.). Boca Raton, FL: Chapman & Hall.
(2005). Evaluations of sexy women in low- and high-status jobs. Psychology of Women Quarterly, 29, 389–395. doi: 10.1111/j.1471-6402.2005.00238.x
(1982). Race of examiner effects and the validity of intelligence-tests. Review of Educational Research, 52, 469–497. doi: 10.2307/1170263
(1985). Mediation of interpersonal expectancy effects – 31 meta-analyses. Psychological Bulletin, 97, 363–386. doi: 10.1037/0033-2909.97.3.363
(2006). Fairness is not validity or cultural bias in racial-group assessment: A quantitative perspective. American Psychologist, 61, 845–859.
(1996). Stereotypes. Annual Review of Psychology, 47, 237–271. doi: 10.1146/annurev.psych.47.1.237
(2010). Multilevel analysis: Techniques and applications (2nd ed.). New York, NY: Routledge.
(2001). The accuracy of multilevel structural equation modeling with pseudobalanced groups and small samples. Structural Equation Modeling, 8, 157–174.
(2012). How few countries will do? Comparative survey analysis from a Bayesian perspective. Survey Research Methods, 6, 87–93.
(2009). Race of the interviewer and the black-white test score gap. Social Science Research, 38, 31–40. doi: 10.1016/j.ssresearch.2008.07.004
(2009). Interacting with women can impair men’s cognitive functioning. Journal of Experimental Social Psychology, 45, 1041–1044. doi: 10.1016/j.jesp.2009.05.004
(2005). Attitudes toward younger and older adults: An updated meta-analytic review. Journal of Social Issues, 61, 241–266. doi: 10.1111/j.1540-4560.2005.00404.x
(2009). Stereotyping based on voice in the presence of individuating information: Vocal femininity affects perceived competence but not warmth. Personality and Social Psychology Bulletin, 35, 198–211. doi: 10.1177/0146167208326477
(2004). Psychological research online. Report of Board of Scientific Affairs’ Advisory Group on the conduct of research on the Internet. American Psychologist, 59, 105–117. doi: 10.1037/0003-066X.59.2.105
(1996). Forming impressions from stereotypes, traits, and behaviors: A parallel-constraint-satisfaction theory. Psychological Review, 103, 284–308. doi: 10.1037/0033-295X.103.2.284
(2007). Are there test administrator effects in Large-Scale educational assessments? Using cross-classified multilevel analysis to probe for effects on mathematics achievement and sample attrition. Methodology, 3, 149–159. doi: 10.1027/1614-2241.3.4.149
(2004a). Robustness issues in multilevel regression analysis. Statistica Neerlandica, 58, 127–137.
(2004b). The influence of violations of assumptions on multilevel parameter estimates and their standard errors. Computational Statistics & Data Analysis, 46, 427–440.
(1980). Influence of examiners ethnic attributes on Intelligence-Test Scores. Psychology in the Schools, 17, 117–122. doi: 10.1002/1520-6807(198001)17:1<117::AID-PITS2310170122>3.0.CO;2-6
(2007). Improving international tests and testing. European Psychologist, 12, 206–219. doi: 10.1027/1016-9040.12.3.206
(2012). Bayesian structural equation modeling: A more flexible representation of substantive theory. Psychological methods, 17, 313–335.
(1998-2012). Mplus user’s guide (7th ed.). Los Angeles, CA: Muthén & Muthén.
(2011). Test administrator’s gender affects female and male students’ self-estimated verbal general knowledge. Learning and Instruction, 21, 14–21. doi: 10.1016/j.learninstruc.2009.09.003
(2009). Age stereotypes in the workplace: Common stereotypes, moderators, and future research directions. Journal of Management, 35, 158–188. doi: 10.1177/0149206308318617
(2002). Ageism in teaching: stereotypical beliefs and discriminatory attitudes towards the over-50s. Work Employment and Society, 16, 355–371. doi: 10.1177/095001702400426884
(1976). Experimenter effects in behavioral research. New York, NY: Irvington.
(2000). Motivated stereotyping of women: She’s fine if she praised me but incompetent if she criticized me. Personality and Social Psychology Bulletin, 26, 1329–1342. doi: 10.1177/0146167200263002
(2000). Instrumental and expressive traits, trait stereotypes, and sexist attitudes – What do they signify? Psychology of Women Quarterly, 24, 44–62. doi: 10.1111/j.1471-6402.2000.tb01021.x
(1998). Automatic activation of stereotypes: The role of self-image threat. Personality and Social Psychology Bulletin, 24, 1139–1152. doi: 10.1177/01461672982411001
(1989). Social-comparison activity under Threat – downward evaluation and upward contacts. Psychological Review, 96, 569–575. doi: 10.1037/0033-295X.96.4.569
(2002). Age and assessments of professional expertise: The relationship between higher level employees’ age and self-assessments or supervisor ratings of professional expertise. International Journal of Selection and Assessment, 9, 309–324. doi: 10.1111/1468-2389.00183
(2004). Why boys achieve less at school than girls: The difference between boys’ and girls’ academic culture. Educational Studies, 30, 159–173. doi: 10.1080/0305569032000159804
(2014). In the eye of the examinee: Likable examiners interfere with performance. Social Psychology of Education, 17, 401–417. doi: 10.1007/s11218-014-9252-z
(1981). Downward Comparison Principles in Social-Psychology. Psychological Bulletin, 90, 245–271. doi: 10.1037/0033-2909.90.2.245
(