Skip to main content
Original Article

Incentives and Alternative Rating Approaches

Roads to Greater Accuracy in Job Performance Assessment?

Published Online:https://doi.org/10.1027/1866-5888/a000068

Ratings of others’ performance are central in applied psychology. We investigated rater incentives and two approaches to Behavior Observation Scale (BOS) rating on rating accuracy. Raters (N = 147) were randomly assigned to one of three accuracy-incentive conditions and completed one of two BOSs 48 hr after observing videotaped performances. A serial BOS asked raters to assess one ratee at a time, across behaviors. A parallel BOS had raters consider all ratees on one behavior at a time. The serial (one ratee at a time) approach was generally more accurate than the parallel (one behavior at a time) approach. However, an accuracy incentive prior to observation mitigated the negative effects of the parallel approach. Overall, a serial BOS seems well suited for developmental appraisal.

References

  • Alba, J. W. , Hasher, L. (1983). Is memory schematic? Psychological Bulletin, 93, 203–231. First citation in articleCrossrefGoogle Scholar

  • Benson, P. G. , Buckley, M. R. , Hall, S. (1988). The impact of rating scale format on rater accuracy: An evaluation of the mixed standard scale. Journal of Management, 14, 415–423. First citation in articleCrossrefGoogle Scholar

  • Bommer, W. H. , Johnson, J. L. , Rich, G. A. , Podsakoff, P. M. , MacKenzie, S. B. (1995). On the interchangeability of objective and subjective measures of employee performance: A meta-analysis. Personnel Psychology, 48, 587–605. First citation in articleCrossrefGoogle Scholar

  • Borman, W. C. (1977). Consistency of rating accuracy and rating errors in the judgment of human performance. Organizational Behavior and Human Performance, 20, 238–252. First citation in articleCrossrefGoogle Scholar

  • Brtek, M. D. , Motowidlo, S. J. (2002). Effects of procedure and outcome accountability on interview validity. Journal of Applied Psychology, 87, 185–191. First citation in articleCrossrefGoogle Scholar

  • Buunk, A. P. , Gibbons, F. X. (2007). Social comparison: The end of a theory and the emergence of a field. Organizational Behavior and Human Decision Processes, 102, 3–21. First citation in articleCrossrefGoogle Scholar

  • Cardy, R. L. , Dobbins, G. H. (1994). Performance appraisal: Alternative perspectives. Cincinnati, OH: South-Western. First citation in articleGoogle Scholar

  • Cronbach, L. J. (1955). Processes affecting scores on “understanding of others” and “assumed similarity”. Psychological Bulletin, 52, 177–193. First citation in articleCrossrefGoogle Scholar

  • DeNisi, A. S. (1996). A cognitive approach to performance appraisal: A program of research. New York, NY: Routledge. First citation in articleGoogle Scholar

  • DeNisi, A. S. , Gonzalez, J. A. (2000). Design performance appraisal systems to improve performance. In E. A. Locke, (Ed.), Blackwell Handbook of Principles of Organizational Behavior (pp. 60–72). Oxford, UK: Blackwell. First citation in articleGoogle Scholar

  • DeNisi, A. S. , Peters, L. H. (1996). Organization of information in memory and the performance appraisal process: Evidence from the field. Journal of Applied Psychology, 81, 717–737. First citation in articleCrossrefGoogle Scholar

  • DeNisi, A. S. , Robbins, T. , Cafferty, T. P. (1989). Organization of information used for performance appraisals: Role of diary-keeping. Journal of Applied Psychology, 74, 124–129. First citation in articleCrossrefGoogle Scholar

  • Dominick, P. G. (2009). Forced rankings: Pros, cons, and practices. In J. W. Smither, M. London, (Eds.), Performance management: Putting research into action (pp. 411–443). San Francisco, CA: Jossey-Bass. First citation in articleGoogle Scholar

  • Farr, J. L. , Jacobs, R. (2006). Trust us: New perspectives on performance appraisal. In W. Bennett, Jr. , C. E. Lance, D. J. Woehr, (Eds.), Performance measurement: Current perspectives and future challenges (pp. 321–337). Mahwah, NJ: Erlbaum. First citation in articleGoogle Scholar

  • Faul, F. , Erdfelder, E. , Lang, A.-G. , Buchner, A. (2007). G*Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behavior Research Methods, 39, 175–191. First citation in articleCrossrefGoogle Scholar

  • Goffin, R. D. , Gellatly, I. R. , Paunonen, S. V. , Jackson, D. N. , Meyer, J. P. (1996). Criterion validation of two approaches to performance appraisal: The behavioral observation scale and the relative percentile method. Journal of Business and Psychology, 11, 23–33. First citation in articleCrossrefGoogle Scholar

  • Goffin, R. D. , Jelley, R. B. , Powell, D. M. , Johnston, N. G. (2009). Taking advantage of social comparisons in performance appraisal: The Relative Percentile Method. Human Resource Management, 48, 251–268. First citation in articleCrossrefGoogle Scholar

  • Goffin, R. D. , Olson, J. M. (2011). Is it all relative? Comparative judgments and the possible improvement of self-ratings and ratings of others. Perspectives in Psychological Science, 6, 48–60. First citation in articleCrossrefGoogle Scholar

  • Harris, M. M. , Ispas, D. , & Schmidt, G. F. (2008). Inaccurate performance ratings are a reflection of larger organizational issues. Industrial and Organizational Psychology: Perspectives on Science and Practice, 1, 190–193. First citation in articleCrossrefGoogle Scholar

  • Hastie, R. , Park, B. (1986). The relationship between memory and judgment depends on whether the judgment task is memory-based or on-line. Psychological Review, 93, 258–268. First citation in articleCrossrefGoogle Scholar

  • Heneman, R. L. (1986). The relationship between supervisory ratings and results-oriented measures of performance: A meta-analysis. Personnel Psychology, 39, 811–826. First citation in articleCrossrefGoogle Scholar

  • Jelley, R. B. , Goffin, R. D. (2001). Can performance-feedback accuracy be improved? Effects of rater priming and rating-scale format on rating accuracy. Journal of Applied Psychology, 86, 134–144. First citation in articleCrossrefGoogle Scholar

  • Jenkins, G. D. Jr. , Mitra, A. , Gupta, N. , Shaw, J. D. (1998). Are financial incentives related to performance? A meta-analytic review of empirical research. Journal of Applied Psychology, 83, 777–787. First citation in articleCrossrefGoogle Scholar

  • Kane, J. S. , Woehr, D. J. (2006). Performance measurement reconsidered: An examination of frequency estimation as a basis for performance assessment. In W. Bennett, Jr. , C. E. Lance, D. J. Woehr, (Eds.), Performance measurement: Current perspectives and future challenges (pp. 77–110). Mahwah, NJ: Erlbaum. First citation in articleGoogle Scholar

  • Landy, F. J. , Farr, J. L. (1980). Performance rating. Psychological Bulletin, 87, 72–107. First citation in articleCrossrefGoogle Scholar

  • Latham, G. P. , Wexley, K. N. (1977). Behavioral observation scales for performance appraisal purposes. Personnel Psychology, 30, 255–268. First citation in articleCrossrefGoogle Scholar

  • Latham, G. P. , Wexley, K. N. (1994). Increasing productivity through performance appraisal (2nd ed.). Reading, MA: Addison-Wesley. First citation in articleGoogle Scholar

  • Lord, R. G. , Maher, K. J. (1991). Cognitive theory in industrial and organizational psychology. In M. D. Dunnette, L. M. Hough, (Eds.), Handbook of industrial and organizational psychology (2nd ed., Vol. 2, pp. 1–62). Palo Alto, CA: Consulting Psychologists Press. First citation in articleGoogle Scholar

  • McGraw, K. O. , Wong, S. P. (1996). Forming inferences about some intraclass correlation coefficients. Psychological Methods, 1, 30–46. First citation in articleCrossrefGoogle Scholar

  • Melchers, K. G. , Henggeler, C. , Kleinmann, M. (2007). Do within-dimension ratings in assessment centers really lead to improved construct validity? A meta-analytic reassessment. Zeitschrift für Personalpsychologie, 6, 141–149. First citation in articleLinkGoogle Scholar

  • Murphy, K. R. , Cleveland, J. N. (1995). Understanding performance appraisal: Social, organizational, and goal-based perspectives. Thousand Oaks, CA: Sage. First citation in articleGoogle Scholar

  • Murray, H. G. (1991). Effective teaching behaviors in the college classroom. In J. C. Smart, (Ed.), Higher education: Handbook of theory and research (Vol. 7, pp. 135–172). New York, NY: Agathon. First citation in articleGoogle Scholar

  • Nathan, B. R. , Alexander, R. A. (1988). A comparison of criteria for test validation: A meta-analytic investigation. Personnel Psychology, 41, 517–535. First citation in articleCrossrefGoogle Scholar

  • Roch, S. G. , Sternburgh, A. M. , Caputo, P. M. (2007). Absolute vs relative performance rating formats: Implications for fairness and organizational justice. International Journal of Selection and Assessment, 15, 302–316. First citation in articleCrossrefGoogle Scholar

  • Salvemini, N. J. , Reilly, R. R. , Smither, J. W. (1993). The influence of rater motivation on assimilation effects and accuracy in performance ratings. Organizational Behavior and Human Decision Processes, 55, 41–60. First citation in articleCrossrefGoogle Scholar

  • Sanchez, J. I. , De La Torre, P. (1996). A second look at the relationship between rating and behavioral accuracy in performance appraisal. Journal of Applied Psychology, 81, 3–10. First citation in articleCrossrefGoogle Scholar

  • Smither, J. W. , Barry, S. R. , Reilly, R. R. (1989). An investigation of the validity of expert true score estimates in appraisal research. Journal of Applied Psychology, 74, 143–151. First citation in articleCrossrefGoogle Scholar

  • Sulsky, L. M. , Balzer, W. K. (1988). Meaning and measurement of performance rating accuracy: Some methodological and theoretical concerns. Journal of Applied Psychology, 73, 497–506. First citation in articleCrossrefGoogle Scholar

  • Wagner, S. H. , Goffin, R. D. (1997). Differences in accuracy of absolute and comparative performance appraisal methods. Organizational Behavior and Human Decision Processes, 70, 95–103. First citation in articleCrossrefGoogle Scholar

  • Wiersma, U. , Latham, G. P. (1986). The practicality of behavioral observation scales, behavioral expectation scales, and trait scales. Personnel Psychology, 39, 619–628. First citation in articleCrossrefGoogle Scholar

  • Williams, K. J. , Cafferty, T. P. , DeNisi, A. S. (1990). The effect of performance appraisal salience on recall and ratings. Organizational Behavior and Human Decision Processes, 46, 217–239. First citation in articleCrossrefGoogle Scholar

  • Woehr, D. J. , Feldman, J. (1993). Processing objective and question order effects on the causal relation between memory and judgment in performance appraisal: The tip of the iceberg. Journal of Applied Psychology, 78, 232–241. First citation in articleCrossrefGoogle Scholar