Representation of Competencies in Multidimensional IRT Models with Within-Item and Between-Item Multidimensionality
Abstract
Multidimensional item response theory (MIRT) holds considerable promise for the development of psychometric models of competence. It provides an ideal foundation for modeling performance in complex domains, simultaneously taking into account multiple basic abilities. The aim of this paper is to illustrate the relations between a two-dimensional IRT model with between-item multidimensionality and a nested-factor model with within-item multidimensionality, and the different substantive meanings of the ability dimensions in the two models. Both models are applied to empirical data from a large-scale assessment of reading and listening comprehension in a foreign language. In the between-item model, performance in the reading and listening items is modeled by two separate dimensions. In the within-item model, one dimension represents the abilities common to both tests, and a second dimension represents abilities specific to listening comprehension. Distinct relations of external variables, such as gender and cognitive abilities, with ability scores demonstrate that the alternative models have substantively different implications.
References
1997). The multidimensional random coefficients multinomial logit model. Applied Psychological Measurement, 21, 1–23.
(2008). Ziele und Anlage der DESI-Studie [
(Aims and layout of the DESI study ]. In , Unterricht und Kompetenzerwerb in Deutsch und Englisch: Ergebnisse der DESI-Studie [Instruction and competence development in German and English. Results of the DESI study ] (pp. 11–33). Weinheim: Beltz.2007). Sprachliche Kompetenzen. Konzepte und Messung [
(Language competencies. Concepts and measurement ]. Weinheim: Beltz.1989). Structural equations with latent variables. New York: Wiley.
(1988). Statistical power analysis for the behavioral sciences (2nd ed.). Hillsdale, NJ: Erlbaum.
(2006). The continued search for nonarbitrary metrics in psychology. American Psychologist, 61, 50–55.
(1989). Theory and application of replicate weighting for variance calculations. In Proceedings of the Section on Survey Research Methods (pp. 212–217). Washington, DC: American Statistical Association.
(2008). Sprachkompetenzen von Mädchen und Jungen [
(Language competencies of girls and boys ]. In , Unterricht und Kompetenzerwerb in Deutsch und Englisch: Ergebnisse der DESI-Studie [Instruction and competence development in German and English. Results of the DESI study ] (pp. 202–207). Weinheim: Beltz.2008). Methodische Grundlagen der Messung und Erklärung sprachlicher Kompetenzen [
(Methods and basic principles of the measurement and prediction of language competencies ]. In , Unterricht und Kompetenzerwerb in Deutsch und Englisch: Ergebnisse der DESI-Studie [Instruction and competence development in German and English. Results of the DESI study ] (pp. 34–54). Weinheim: Beltz.2000). Kognitiver Fähigkeits-Test (Rev.) für 5.–12. Klassen [
(Cognitive ability test – revised – for the 5th to 12th grade ]. Göttingen: Beltz.2008). Unterricht und Kompetenzerwerb in Deutsch und Englisch: Ergebnisse der DESI-Studie [
(Instruction and competence development in German and English. Results of the DESI study ]. Weinheim: Beltz.2008). Current issues in research on competence modeling and assessment. Zeitschrift für Psychologie / Journal of Psychology, 216, 60–72.
(1997). Normal-ogive multidimensional model. In , Handbook of modern item response theory (pp. 257–269). New York: Springer-Verlag.
(2000). A basis for multidimensional item response theory. Applied Psychological Measurement, 24, 99–114.
(1992). Estimating population characteristics from sparse matrix samples of responses. Journal of Educational Measurement, 29, 133–161.
(1997). First order or higher order general factors? Structural Equation Modeling, 4, 193–211.
(2002). Beyond SEM: General latent variable modelling. Behaviormetrika, 29, 81–117.
(2007). Mplus user’s guide (4th ed.). Los Angeles, CA: Muthén & Muthén.
(2007a). Hörverstehen [
(Listening comprehension ]. In , Sprachliche Kompetenzen: Konzepte und Messung [Language competencies: Concepts and measurement ] (pp. 178–196). Weinheim: Beltz.2007b). Leseverstehen [
(Reading comprehension ]. In , Sprachliche Kompetenzen. Konzepte und Messung [Language competencies: Concepts and measurement ] (pp. 197–211). Weinheim: Beltz.2008a). Hörverstehen Englisch [
(English listening comprehension ]. In , Unterricht und Kompetenzerwerb in Deutsch und Englisch: Ergebnisse der DESI-Studie [Instruction and competence development in German and English. Results of the DESI study ] (pp. 120–129). Weinheim: Beltz.2008b). Sprechen English [
(Spoken English ]. In , Unterricht und Kompetenzerwerb in Deutsch und Englisch: Ergebnisse der DESI-Studie [Instruction and competence development in German and English. Results of the DESI study ] (pp. 170–179). Weinheim: Beltz.2008). Leseverstehen Englisch [
(English reading comprehension ]. In , Unterricht und Kompetenzerwerb in Deutsch und Englisch: Ergebnisse der DESI-Studie [Instruction and competence development in German and English. Results of the DESI study ] (pp. 130–138). Weinheim: Beltz.2004). SET-10 ® Test description. Validation summary. Menlo Park, CA: Ordinate.
. (2004). GLAMM manual. UC Berkeley Division of Biostatistics Working Paper No. 160.
(1997a). A linear logistic multidimensional model for dichotomous item response data. In , Handbook of modern item response theory (pp. 271–286). New York: Springer-Verlag.
(1997b). The past and future of multidimensional item response theory. Applied Psychological Measurement, 21, 25–36.
(2004). Generalized latent variable modeling: Multilevel, longitudinal, and structural equation models. Boca Raton: Chapman & Hall.
(2003). Comparing multidimensional and unidimensional proficiency classifications: Multidimensional IRT as a diagnostic aid. Journal of Educational Measurement, 40, 255–275.
(2000). WesVar 4.0 user’s guide. Rockville, MD: Author.
. (1998). ConQuest: Generalized item response modeling software. Melbourne: Australian Council for Educational Research.
(