How to score respondents? A Monte Carlo study comparing three different procedures

ABSTRACT This study aimed to empirically compare the effectiveness of Likert, Thurstonian, and Expected a Posteriori (EAP) scoring methods. A computational simulation of the two-parameter logistic model was employed under various conditions, including different sample sizes, number of items, scale l...

Full description

Saved in:
Bibliographic Details
Main Authors: Víthor Rosa Franco, Marie Wiberg
Format: Article
Language:Portuguese
Published: Instituto Brasileiro de Avaliação Psicológica (IBAP) 2025-06-01
Series:Avaliação Psicológica
Subjects:
Online Access:http://pepsic.bvsalud.org/scielo.php?script=sci_arttext&pid=S1677-04712025000100101&lng=en&tlng=en
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:ABSTRACT This study aimed to empirically compare the effectiveness of Likert, Thurstonian, and Expected a Posteriori (EAP) scoring methods. A computational simulation of the two-parameter logistic model was employed under various conditions, including different sample sizes, number of items, scale levels, item extremeness, and varying discriminations. Effectiveness was assessed through correlation with the true score, root mean squared errors, bias, and the accurate recovery of effect sizes. The results indicated that Likert scores exhibit greater bias than EAP and Thurstonian scores for extreme scores, however, they strongly correlate with both the true score and EAP scores. Likert scores were slightly more effective in recovering mean differences between two groups, correlation estimates, and regression parameters. Overall, Likert scores should be avoided when ordering or thresholding individuals at the extremes of scales is the primary objective. However, they are preferable in situations where Thurstonian and EAP scores may fail to converge. The study also recommends that future research explore conditions involving more complex data-generating processes.
ISSN:2175-3431