Construction and Validation of a Test for Inductive Reasoning1
Abstract
Summary We present in this paper a test for inductive reasoning (TIR), which consists of two versions that can be used to assess the inductive reasoning development of third-grade pupils in primary education. The test versions can also be used in combination with a training program for inductive reasoning. Two experiments using samples of 954 and 145 pupils were carried out to investigate the psychometric properties of the tests, including validity. Item response theory (IRT) analyses revealed that the scores on the two TIR tests gave meaningful inductive reasoning summaries. This was supported by analyses of the convergent and divergent validity of the TIR tests. IRT analyses were used to equate the two TIR test versions such that the scores can be compared on a common scale. Possible explanations for the misfit of items that were deleted from the TIR tests are discussed.
References
References
Andersen, E.B. (1973). A goodness of fit test for the Rasch model.. Psychometrika, 38, 123– 140Bereiter, C., Scardamalia, M. (1979). Pascual-Leone's M construct as a link between cognitive-developmental and psychometric concepts of intelligence.. Intelligence, 3, 41– 63Bidell, T.R., Fischer, K.W. (1992). Beyond the stage debate: Action, structure, and variability in Piagetian theory and research.. In R.J. Sternberg & C.A. Berg (Eds.), Intellectual development (pp. 100-140). Cambridge: Cambridge University Press.Brown, A.L., Campione, J.C., Reeve, R.A., Ferrara, R.A., Palinscar, A.S. (1991). Interactive learning and individual understanding: The case of reading and mathematics.. In L.T. Landsmann (Ed.), Culture, schooling, and psychological development (pp. 136-170). Norwood, NJ: Ablex Publishing Corporation.Carey, S. (1985). Conceptual change in childhood. . Cambridge, MA: MIT Press.Carpenter, P.A., Just, M.A., Shell, P. (1990). What one intelligence test measures: A theoretical account of the processing in the Raven Progressive Matrices Test.. Psychological Review, 97(3), 404– 431Carroll, J.B. (1993). Human cognitive abilities: A survey of factor-analytic studies. . New York: Cambridge University Press.Case, R. (1974). Structures and strictures: Some functional limitations on the course of cognitive growth.. Cognitive Psychology, 6, 544– 5731995). Luistertoets. Handleiding. ‘Listening Comprehension Test. Manual’.. Arnhem: Author.
(Csapó, B. (1999). Improving thinking through the content of teaching.. In J.H.M. Hamers, J.E.H van Luit, & B. Csapó (Eds.), Teaching and learning thinking skills (pp. 37-63). Lisse: Swets & Zeitlinger.De Koning, E. (2000). Inductive reasoning in primary education. Measurement, teaching, transfer. . Zeist: Kerckbosch.De Koning, E., Hamers, J.H.M. (1995). Programma Inductief Redeneren 1 ‘Program Inductive Reasoning 1’.. Utrecht: Utrecht University Press ISOR.De Koning, E., Hamers, J.H.M. (1999). Teaching inductive reasoning: Theoretical background and educational implications.. In J.H.M. Hamers, J.E.H. van Luit, & B. Csapó (Eds.), Teaching and learning thinking skills (pp. 157-188). Lisse: Swets & Zeitlinger.De Koning, E., Hamers, J.H.M., Sijtsma, K., Vermeer, A. (2002). Teaching and transfer of inductive reasoning in primary education.. Developmental Review, 22, 211– 241De Koning, E., Sijtsma, K., Hamers, J.H.M. (2002). Comparison of four IRT models when analyzing two tests for inductive reasoning.. Applied Psychological Measurement, 26, 302– 320Dodwel, P.C. (1960). Children's understanding of number and related concepts.. Canadian Journal of Psychology, 14, 191– 205Engelen, R.J.H., Eggen, T.J.H.M. (1993). Equivaleren ‘Equating’.. In T.J.H.M. Eggen & P.F. Sanders (Eds.), Psychometrie in de praktijk ‘Psychometrics into practice’ (pp. 309-348). Arnhem: CITO Instituut voor Toetsontwikkeling.Evans, T.G. (1968). A program for the solution of a class of geometric analogy intelligence test questions.. In M. Minsky (Ed.), Semantic information processing (pp. 271-353). Cambridge, MA: MIT Press.Feurstein, R., Rand, Y., Jensen, M.R., Kaniel, S., Tzuriel, D. (1987). Prerequisites for assessment of learning potential: The LPAD model.. In C.S. Lidz (Ed.), Dynamic Assessment: An interactional approach to evaluating learning potential (pp. 35- 51). New York: Guilford.Glas, C.A.W., Ouborg, M.J. (1993). Vraagonzuiverheid ‘Differential item functioning’.. In J.H.M. Eggen & P.F. Sanders (Eds.), Psychometrie in de praktijk ‘Psychometrics into practice’ (pp. 349-370). Arnhem: CITO Instituut voor Toetsontwikkeling.Glas, C.A.W., Verhelst, N.D. (1995). Testing the Rasch model.. In G.H. Fischer & I.W. Molenaar (Eds.), Rasch models. Foundations, recent developments, and applications (pp. 69-95). New York: Springer-Verlag.Glas, C.A.W., Ellis, J.L. (1993). User's manual RSP. Rasch Scaling Program. . Groningen, The Netherlands: iecProGAMMA.Goswami, U. (1991). Analogical Reasoning: What develops? A review of research and theory.. Child Development, 62, 1– 22Grigorenko, E.L. Sternberg, R.J. (1998). Dynamic testing.. Psychological Bulletin, 124, 75– 111Hamers, J.H.M., De Koning, E. Ruijssenaars, A.J.J.M. (1997). A diagnostic program as learning potential assessment procedure.. Educational and Child Psychology, 14, 73– 82Hamers, J.H.M., De Koning, E., Sijtsma, K. (1998). Inductive reasoning in the third grade: Intervention promises and constraints.. Contemporary Educational Psychology, 23, 132– 148Holland, J.H., Holyoak, H.J., Nisbett, R.E., Thagard, P.R. (1986). Induction. Processes of inference, learning and discovery. . Cambridge, MA: MIT Press.Hosenfeld, B., Van den Boom, D.C., Resing, W. (1997). New Instrument.. Constructing geometric analogies for the longitudinal testing of elementary school children. Journal of Educational Measurement, 34, 4, 367– 372Hunt, E.B. (1974). Quote the Raven? Nevermore!. In L.W. Gregg (Ed.), Knowledge and cognition (pp. 129-158). Hillsdale, NJ: Erlbaum.Klauer, K.J. (1989). Denktraining für Kinder 1. Ein Program zur intellektuellen Förderung ‘Inductive reasoning. A program for the stimulation of inductive reasoning’.. Göttingen: Hogrefe.Klauer, K. (1990). A process theory of inductive reasoning tested by the teaching of domain-specific thinking strategies.. European Journal of Psychology of Education, 5, 191– 206Klauer, K.J. (1997). Lässt sich die Strategie des induktiven Denkens auf schulisches Lernen transferierbar lehren? ‘Can the strategy to reason inductively be taught such that it transfers to learning of school-type material?’. Zeitschrift für Entwicklungspsychologie und Pädagogische Psychologie, 29, 225– 241Klauer, K.J. (1999). Über den Einfluss des induktiven Denkens auf den Erwerb unanschaulich-generischen Wissens bei Grund- und Sonderschülern. ‘On the impact of inductive reasoning on the acquisition of abstract generic knowledge with elementary school and with learning disabled children.’. Psychologie in Erziehung und Unterricht, 46, 7– 28Marshalek, B., Lohman, D.F., Snow, R.E. (1983). The complexity continuum in the radex and hierarchical models of intelligence.. Intelligence, 7, 107– 127Meijer, R.R., Sijtsma, K., Smid, N.G. (1990). Theoretical and empirical comparison of the Mokken and the Rasch approach to IRT.. Applied Psychological Measurement, 14, 283– 298Mokken, R.J. (1971). A theory and procedure of scale analysis. . The Hague: Mouton/Berlin: De Gruyter.Mokken, R.J. (1997). Nonparametric models for dichotomous responses.. In W.J. van der Linden & R.K. Hambleton (Eds.), Handbook of modern item response theory (pp. 351-367). New York: Springer-Verlag.Molenaar, I.W., Sijtsma, K., (2000). MSP5 for Windows. User's manual. . Groningen, The Netherlands: iecProGAMMA.Mulholland, T.M., Pellegrino, J.W., Glaser, R. (1980). Components of geometric analogy solution.. Cognitive Psychology, 12, 252– 284Nisbett, R.E. (1993). Rules for reasoning. . Hillsdale, NJ: Erlbaum.Nunnally, J.C. (1978). Psychometric theory. . New York: McGraw-Hill.Palinscar, A.S., Brown, A.L. (1988). Teaching and practical thinking skills to promote comprehension in the context of group problem solving.. RASE: Remedial and Special Education, 9, 1, 53– 59Pascual-Leone, L. (1970). A mathematical model for the transition rule in Piaget's developmental stages.. Acta Psychologica, 32, 4, 301– 345Pennings, A.H., Hessels, M.G.P. (1996). The measurement of mental attentional capacity: A Neo-Piagetian developmental study.. Intelligence, 23, 1, 59– 78Piaget, J. (1970). Piaget's theory.. In P.H. Mussen (Ed.), Carmichael's handbook of child development (pp. 703-732). New York: Wiley.Ponocny, I. (2001). Nonparametric goodness-of-fit tests for the Rasch model.. Psychometrika, 66, 437– 460Rasch, G. (1960). Probabilistic models for some intelligence and attainment tests. . Copenhagen, Denmark: Nielsen & Lydiche.Raven, J.C. (1958). Standard Progressive Matrices. . London: Lewis.Richardson, K. (1996). Putting Raven into context: A response to Roberts & Stevenson.. British Journal of Educational Psychology, 66, 533– 538Roberts, M.J., Stevenson, N.J. (1996). Reasoning with Raven - with and without help.. British Journal of Educational Psychology, 66, 519– 532Sijtsma, K. (1983). Rasch-homogeniteit empirisch onderzocht ‘Rasch homogeneity empirically examined’.. Tijdschrift voor Onderwijsresearch, 8, 104– 121Sijtsma, K., Molenaar, I.W. (2002). Introduction to nonparametric item response theory. . Thousand Oaks, CA: Sage.Sijtstra, J. (1992). Balans van het taalonderwijs halverwege de basisschool ‘Evaluation of language education half-way primary school’.. Arnhem: CITO Instituut voor Toetsontwikkeling.Snow, R.E., Kyllonen, P.C., Marshalek, B. (1984). The topography of ability and learning correlations.. In R.J. Sternberg (Ed.), Advances in the psychology of human intelligence (pp. 47-103). Hillsdale, NJ: Erlbaum.Spearman, C. (1927). The abilities of man. . New York: Macmillan.Sternberg, R.J. (1998). When will the milk spoil? Everyday induction in human intelligence.. Intelligence, 25, 3, 185– 203Sternberg, R.J., Gardner, M.K. (1983). Unities in inductive reasoning.. Journal of Experimental Psychology: General, 112, 1, 80– 116Stout, W.F. (1990). A new item response theory modeling approach with applications to unidimensionality assessment and ability estimation.. Psychometrika, 55, 293– 325Van den Wollenberg, A.L. (1982). A simple and effective method to test the dimensionality axiom of the Rasch model.. Applied Psychological Measurement, 6, 83– 91Van der Linden, W.J., Hambleton, R.K. (1997). Handbook of modern item response theory. . New York: Springer-Verlag.Veldhuijzen, N.H., Godebeld, P., Sanders, P.F. (1993). Klassieke testtheorie en generalizeerbaarheidstheorie. ‘Classic test theory and generalizability theory’.. In T.J.H.M. Eggen & P.F. Sanders (Eds.), Psychometrie in de praktijk ‘Psychometrics into practice’ (pp. 33-82). Arnhem: CITO Instituut voor Toetsontwikkeling.Verhelst, N.D., Glas, C.A.W. (1995). The one parameter logistic model.. In G.H. Fischer & I.W. Molenaar (Eds.), Rasch models. Foundations, recent developments, and applications (pp. 215- 237). New York: Springer-Verlag.Verhoeven, L. (1996). Woordenschattoets I. Handleiding. ‘Vocabulary Test I. Manual’.. Arnhem: CITO.Vernon, P.E. (1971). The structure of human abilities. . London: Methuen.Vosniadou, S. (1989). Analogical reasoning as a mechanism in knowledge acquisition: A developmental perspective.. In S. Vosniadou & A. Ortony (Eds.), Similarity and analogical reasoning (pp. 413-437). Cambridge: Cambridge University Press.Wijnstra, J. (1987). De samenstelling van de schoolbevolking in het basisonderwijs. ‘The composition of the school population in primary education.’. Arnhem: CITO Instituut voor Toetsontwikkeling.Willmes, K., Heller, K.A., Lengfelder, A. (1997). Testrezension zu Standard Progressive Matrices. ‘A review of the Standard Progressive Matrices (SPM).’. Zeitschrift für Differentielle und Diagnostische Psychologie, 18, 117– 120Zimmerman, C. (2000). The development of scientific reasoning skills.. Developmental Review, 20, 99– 149