摘要

An essential concern in the application of any equating procedure is determining whether tests can be considered equated after the tests have been placed onto a common scale. This article clarifies one equating criterion, the first-order equity property of equating, and develops a new method for evaluating equating that is linked to this criterion. The new approach involves graphically examining the difference in test characteristic curves (TCCs), calculating the maximum absolute difference between the TCCs, and comparing the differences in TCCs to the difference that matters (DTM). The new approach is applied to evaluate the equating of the Multi-state Bar Exam (MBE) for six different item response theory (IRT) scaling approaches in the common item nonequivalent groups design. The new approach is also contrasted with Tong and Kolen's (2005) index for assessing first-order equity. The empirical investigations indicated that the Stocking-Lord and fixed-parameter equating methods appear to perform the best for equating the MBE and that the use of concurrent calibration is not desirable.

  • 出版日期2011-5