Abstract
Psychological testing has been undergoing major changes. One of the main changes is the transition from the use of classical to modern test models and methods in test development. The purposes of this paper are to describe the shortcomings of classical test models which are overcome with modern test theory, i.e., item response theory, to introduce the basic concepts of item response theory, to describe several important international applications of item response theory models, and finally, to describe some likely IRT directions in the next century.
References
1989). Psychological testing (6th ed.). New York: Macmillan.
(1990). The construct validity of a Turkish depressive attribution style questionnaire (DASQ) for university students. Unpublished manuscript.
(1989). Item-banking of EFL items using the 3-p logistic model (CAT Project Report No.4). Jerusalem, Israel: National Institute for Testing and Evaluation.
(1989, August). The use of one parameter logistic model in the university entrance examinations in Turkey. Paper presented at the 11th EAIR Forum, University of Trier, Germany.
(1994). Methods for identifying biased test items. Newbury Park, CA: Sage.
(1989). Computerized adaptive test of English proficiency (CAT Project Report No.6). Jerusalem, Israel: National Institute for Testing and Evaluation.
(1989). A manual for NITEST - a program for estimating IRT parameters (CAT Project Report No.1). Jerusalem, Israel: National Institute for Testing and Evaluation.
(1989). A manual for NITECAT a software package for research on CAT/IRT version 1 (CAT Project Report No.2). Jerusalem, Israel: National Institute for Testing and Evaluation.
(1991). IRT equating methods. Educational Measurement: Issues and Practice, 10(3), 37– 45
(1994, August). Using item response theory to detect bias in a translated locus of control scale. Paper presented at the meeting of the British Psychological Society, Brighton.
(1985). A comparison of five methods for estimating the standard error of measurement at specific score levels. Applied Psychological Measurement, 9(4), 351– 361.
(1992). A Rasch model with a multivariate distribution of ability In , Objective measurement: Theory into practice: Vol. 1. Norwood, NJ: Ablex Publishing Corporation.
(1991) Applications of item response theory in equating. China Examinations, 3, 25– 29.
(1950). Theory of mental tests. New York: Wiley.
(1993). Translating achievement tests for use in cross-national studies. European Journal of Psychological Assessment, 9(1), 57– 68.
(1994a). Item response theory: a broad psychometric framework for measurement advances. Psicothema, 6(3), 535– 556.
(1994b). Guidelines for adapting educational and psychological tests: A progress report. European Journal of Psychological Assessment, 10, 229– 240.
(1991). Fundamentals of item response theory. Newbury Park, CA: Sage.
(1993). Differential item functioning. Hillsdale, NJ: Lawrence Erlbaum.
Eds.(1990). Has item response theory increased the validity of achievement test scores?. Applied Measurement in Education, 3(2), 115– 141.
(1990). Testing the reliability of Raven computerized adaptive tests. Information on Psychological Sciences, 34– 39.
(1952). A theory of mental test scores. Psychometric Mongraph No.7.
(1980). Applications of item response theory to practical testing problems. Hillsdale, NJ: Lawrence Erl-baum.
(1968) Statistical theories of mental test scores. Reading, MA: Addison-Wesley.
(1990). Profiles of learning: The Basic Skills Testing Program in New South Wales, 1989. Hawthorn, Australia: Australian Council for Educational Research.
(1996). Partial credit model. In , Handbook of modern item response theory. New York: Springer-Verlag Publishers.
(1991). Woodcock-Johnson Technical Manual. Allen, TX: DLM.
(1960). Probabilistic models for some intelligence and attainment tests. Copenhagen: Neilsen and Lydiche.
(1994). Empirical test of the stage model of organizational socialization: Development of integrated model and verification of its validity. The Japanese Journal of Administrative Behavior, 9(1).
(1996). ConTEST Technical Manual. Groningen, the Netherlands: ProGamma.
(1996). Translating tests: Some practical guidelines. European Psychologist, 1, 89– 99
(1989). A maximin model for test design with practical constraints. Psychometrika, 54(2), 237– 247.
(1996)Handbook of modern item response theory New York: Springer-Verlag Publishers.
(1991). OTD: Optimal test design (Manual). Arnhem, The Netherlands: CITO.
(1990). Computerized adaptive testing: A primer Hillsdale, NJ: Lawrence Erlbaum.
(1994, July). A development of a job interest index by item response theory: Based on Japanese samples. Paper presented at the 23rd International Congress of Applied Psychology, Madrid, Spain.
(1978). Development and standardization of the Woodcock-Johnson psycho-educational battery. New York: Teaching Resources.
(1993). An IRT approach to cross-language test equating and interpretation. European Jounral of Psychological Assessment, 9, 233– 241.
(1979). Best test design. Chicago: MESA.
(1983). Use of the three-parameter logistic model in the development of a standardized achievement test. In , Applications of item response theory (pp.123–141). Vancouver, BC: Educational Research Institute of British Columbia.
(1991). Developing a professional attitude scale for teachers school students. Proceedings of International Academic Symposium on Psychological Measurement, Peking, China.
(