Testing Practices in the 21st Century
Developments and European Psychologists’ Opinions
Abstract
The main goal of the European Federation of Psychologists’Associations (EFPA) Standing Committee on Tests and Testing (SCTT) is the improvement of testing practices in European countries. In order to reach this goal, the SCTT carries out various actions and projects, some of which are described in this paper. To better inform its work, it decided to survey the opinions of professional psychologists on testing practices. A questionnaire of 33 items was administered to a sample of 12,606 professional psychologists from 17 European countries. The questionnaire was based on, but not identical to, one used in 2000. The new data show that the positive attitude of the respondents toward the use of tests that was obtained in 2000 has increased in most countries, with a high percentage of the surveyed psychologists using tests regularly. Five main dimensions explained 43% of the total item variance. The dimensions involve items relating to: Concern over incorrect test use, regulations on tests and testing, Internet testing, appreciation of tests, and knowledge and training relating to tests and test use. Important differences between countries were found on these five dimensions. Differences were found according to gender for four of the five dimensions and in relation to field of specialization for all five dimensions. The most commonly used tests are the classic psychometric tests of intelligence and personality: WISC, WAIS, MMPI, RAVEN, 16PF, NEO-PI-R, BDI, SCL-90. Finally, some future perspectives are discussed.
References
1991). Psychological testing and assessment (7th ed.). Boston, MA: Allyn and Bacon.
(1999). Standards for educational and psychological testing. Washington, DC: American Psychological Association.
. (2009). Comparison of methods for controlling maximum exposure rates in computerized adaptive testing. Psicothema, 21, 313–320.
(1996). Test qualifications and test use in the UK: The competence approach. European Journal of Psychological Assessment, 12, 62–71.
(1998). The need for international guidelines on standards for test use: A review of European and international initiatives. European Psychologist, 2, 155–163.
(2011). Contributions of the EFPA Standing Committee on Tests and Testing (SCTT) to standards and good practice. European Psychologist, 16, 149–159.
(1998). Variations in national patterns of testing and test use: The ITC/EFPPA international survey. European Journal of Psychological Assessment, 14, 249–260.
(2006). Computer-based testing and the Internet. Chichester, UK: Wiley.
(2005). Definition and assessment of competences in the context of the European diploma in psychology. European Psychologist, 10, 93–102.
(1999). Using new technology to improve assessment. Educational Measurement: Issues and Practice., 18, 5–12.
(2006). Inexorable and inevitable: The continuing story of technology and assessment. In , Computer-based testing and the Internet (pp. 201–217). Chichester, UK: Wiley.
(2009, July). An ISO standard for assessment in work and organizational settings. In , International guidelines and standards relating to tests and testing. Symposium conducted at the 11th European Congress of Psychology, Oslo, Norway.
(2006). Facing the opportunities of the future. In , Computer-based testing and the Internet (pp. 219–251). Chichester, UK: Wiley.
(2006). Educational measurement. Westport, CT: ACE/Praeger.
(2011). Item response modelling of forced-choice questionnaires. Educational and Psychological Measurement, 71, 460–502.
(2009). A critical analysis of cross-cultural research and testing practices: Implications for improved education and training in psychology. Training and Education in Professional Psychology, 3, 94–105.
(1988). Statistical power analysis for the behavioral sciences (2nd ed.). Hillsdale, NJ: Erlbaum.
(1951). Coefficient alpha and the internal structure of tests. Psychometrika, 16, 297–334.
(2006). Handbook of test development. Hillsdale, NJ: Erlbaum.
(2006). Technology and testing. In , Educational measurement (pp. 471–515). Westport, CT: ACE/Praeger.
(2010). Testbruk i Norge
([Test use in Norway] . In , Klinisk nevropsykologi. Undersøkelse av voksne pasienter (pp. 51–65). Trondheim, Norway: Tapir Academic Press.2005). Meta-code of ethics. Brussels, Belgium Author (www.efpa.eu)
. (2001a). Improving test quality in the Netherlands: Results of 18 years of test ratings. International Journal of Testing, 1, 137–153.
(2001b). The revised Dutch rating system for test quality. International Journal of Testing, 1, 155–182.
(2009). COTAN Beoordelingssysteem voor de Kwaliteit van Tests (geheel herziene versie)
([COTAN Rating system for test quality (completely revised edition)] . Amsterdam: NIP.2002). Ontwikkelingen in het testgebruik van Nederlandse psychologen
([Developments in test use of Dutch psychologists] . De Psycholoog, 37, 54–61.1988). Test user qualifications: A data-based approach to promoting good test use. Issues in Scientific Psychology. Washington, DC: American Psychological Association.
(1993). Responsible test use. Case studies for assessing human behavior. Washington, DC: American Psychological Association.
(2001). Guidelines for the assessment process (GAP): A proposal for discussion. European Journal of Psychological Assessment, 17, 187–200.
(2004). Student test score reports and interpretive guides: Review of current practices and suggestions for future research. Applied Measurement in Education, 17, 145–220.
(1978). Fundamental statistics in psychology and education (6th ed.). Tokyo, Japan: McGraw-Hill Kogakusha.
(2004). Theory, methods, and practices in testing for the 21st century. Psicothema, 16, 696–701.
(2006, March). Testing practices in the 21st century. Key Note Address. Spain: University of Oviedo.
(2005). Adapting educational and psychological tests for cross-cultural assessment. London, UK: Erlbaum.
(2005). The ITC guidelines on computer-based and Internet-delivered testing. Downloaded electronically on March 4, 2010, from www.intestcom.org/itc_projects.htm
. (2002). Item generation for test development. Mahwah, NJ: Erlbaum.
. (2002). Ethical principles of psychologists and code of conduct. Washington, DC: Author.
(2007). Ethics in psychology. New York: Oxford University Press.
(2008). Psykologisten testien käyttö Suomessa. Testaamisen määrä ja yleisimmät testit
([The use of psychological tests in Finland. The volume of usage and the most popular tests] . Retrieved from www.testilautakunta.fi/Artikkeli.pdf2007). Ethics standards impacting test development and use: A review of 31 ethics codes impacting practices in 35 countries. International Journal of Testing, 7, 71–88.
(2006). The mode effect: A literature review of human and technological issues in computerized testing. International Journal of Testing, 6, 1–24.
(2008). Ethics for European psychologists. Göttingen, Germany and Cambridge, MA: Hogrefe.
(2005). The implications of the “Bologna process” for the development of a European qualification in psychology. European Psychologist, 10, 86–92.
(2010). Item response modelling of paired-comparison and ranking data. Multivariate Behavioural Research, 45, 935–974.
(2002). Computer-based testing: Building the foundation for future assessments. Hillsdale, NJ: Erlbaum.
(1995). Assessment of test user qualifications. American Psychologist, 5, 14–23.
(2007). Improving international tests and testing. European Psychologist, 12, 206–219.
(2001). Testing practices in European countries. European Journal of Psychological Assessment, 17, 201–211.
(2000). La utilización de los tests en España
([Test use in Spain] . Papeles del Psicólogo, 76, 41–49.2008). Construcción de instrumentos de medida para la evaluación universitaria
([Development of measurement instruments for college assessment] . Revista de Investigación en Educación, 5, 13–25.1999). Test use in Spain, Portugal and Latin American countries. European Journal of Psychological Assessment, 15, 151–157.
(1999). Tests informatizados: fundamentos y aplicaciones
([Computer-based tests: Foundations and applications] . Madrid, Spain: Pirámide.2010). Innovative items for computerized testing. In , Elements of adaptive testing (pp. 215–230). London, UK: Springer.
(2002). Practical considerations in computer-based testing. New York: Springer.
(2003). La enseñanza de la Psicología en Europa. Un proyecto de Titulación Europea
([Teaching psychology in Europe: A project of European qualification] . Papeles del Psicólogo, 86, 25–33.2005). Defending standardized testing. London, UK: Erlbaum.
(2008). Correcting fallacies about educational and psychological testing. Washington, DC: American Psychological Association.
(2000). Un modelo para evaluar la calidad de los tests utilizados en España
([A model to assess the quality of tests used in Spain] . Papeles del Psicólogo, 77, 65–71.2003). Automated essay scoring. London, UK: Erlbaum.
(1996). Recommendations by the Canadian Psychological Association for improving the North American safeguards that help protect the public against test misuse. European Journal of Psychological Assessment, 12, 72–82.
(2006). Innovative item formats in computer-based testing: In pursuit of construct representation. In , Handbook of test development. Hillsdale, NJ: Erlbaum.
(2009). Goodness of fit in polytomous items: Type I error rates and empirical power for three fit indexes. Psicothema, 21, 639–645.
(2011). Technology-enhanced assessment of talent. San Francisco, CA: Josey-Bass.
(2010). Elements of adaptive testing. London, UK: Springer.
(2005). Constructing measures: An item response modeling approach. Mahwah, NJ: Erlbaum.
(2002). Technological innovations in large-scale assessment. Applied Measurement in Education, 15, 337–362.
(