TY - JOUR
T1 - A Response to Holster and Lake Regarding Guessing and the Rasch Model
AU - Stewart, Jeffrey
AU - McLean, Stuart
AU - Kramer, Brandon
N1 - Publisher Copyright:
© 2017 Taylor & Francis.
PY - 2017/1/2
Y1 - 2017/1/2
N2 - Stewart questioned vocabulary size estimation methods proposed by Beglar and Nation for the Vocabulary Size Test, further arguing Rasch mean square (MSQ) fit statistics cannot determine the proportion of random guesses contained in the average learner’s raw score, because the average value will be near 1 by design. He illustrated this by demonstrating this is true even of entirely random data. Holster and Lake appear to misinterpret this as a claim that Rasch analyses cannot distinguish random data from real responses. To test this, they compare real data to random and note that, predictably, the statistic easily distinguishes the two and that reliability for random data is near zero. However, while certainly true, this fact is not relevant to Stewart’s argument that multiple-choice options inflate the test’s size estimates and that MSQ fit statistics cannot be used to detect this. We further illustrate this by showing real data retains average MSQ values near 1, even when unknown items skipped by learners are imputed with random guesses. Furthermore, the imputed data do not exhibit “problematic guessing” under Holster & Lake’s own criteria, despite size inflation under Beglar and Nation’s suggested scoring. We conclude by discussing uses of the 3PL model.
AB - Stewart questioned vocabulary size estimation methods proposed by Beglar and Nation for the Vocabulary Size Test, further arguing Rasch mean square (MSQ) fit statistics cannot determine the proportion of random guesses contained in the average learner’s raw score, because the average value will be near 1 by design. He illustrated this by demonstrating this is true even of entirely random data. Holster and Lake appear to misinterpret this as a claim that Rasch analyses cannot distinguish random data from real responses. To test this, they compare real data to random and note that, predictably, the statistic easily distinguishes the two and that reliability for random data is near zero. However, while certainly true, this fact is not relevant to Stewart’s argument that multiple-choice options inflate the test’s size estimates and that MSQ fit statistics cannot be used to detect this. We further illustrate this by showing real data retains average MSQ values near 1, even when unknown items skipped by learners are imputed with random guesses. Furthermore, the imputed data do not exhibit “problematic guessing” under Holster & Lake’s own criteria, despite size inflation under Beglar and Nation’s suggested scoring. We conclude by discussing uses of the 3PL model.
UR - https://www.scopus.com/pages/publications/85009959394
U2 - 10.1080/15434303.2016.1262377
DO - 10.1080/15434303.2016.1262377
M3 - Comment/debate
AN - SCOPUS:85009959394
SN - 1543-4303
VL - 14
SP - 69
EP - 74
JO - Language Assessment Quarterly
JF - Language Assessment Quarterly
IS - 1
ER -