Are Scores on English and French Versions of the PHQ-9 Comparable? An Assessment of Differential Item Functioning

作者:Arthurs Erin; Steele Russell J; Hudson Marie; Baron Murray; Thombs Brett D*
来源:PLos One, 2012, 7(12): e52028.
DOI:10.1371/journal.pone.0052028

摘要

Background: Medical research increasingly utilizes patient-reported outcome measures administered and scored in different languages. In order to pool or compare outcomes from different language versions, instruments should be measurement equivalent across linguistic groups. The objective of this study was to examine the cross-language measurement equivalence of the Patient Health Questionnaire-9 (PHQ-9) between English- and French-speaking Canadian patients with systemic sclerosis (SSc). Methods: The sample consisted of 739 English- and 221 French-speaking SSc patients. Multiple-Indicator Multiple-Cause (MIMIC) modeling was used to identify items displaying possible differential item functioning (DIF). Results: A one-factor model for the PHQ-9 fit the data well in both English- and French-speaking samples. Statistically significant DIF was found for 3 of 9 items on the PHQ-9. However, the overall estimate in depression latent scores between English- and French-speaking respondents was not influenced substantively by DIF. Conclusions: Although there were several PHQ-9 items with evidence of minor DIF, there was no evidence that these differences influenced overall scores meaningfully. The PHQ-9 can reasonably be used without adjustment in Canadian English- and French-speaking samples. Analyses assessing measurement equivalence should be routinely conducted prior to pooling data from English and French versions of patient-reported outcome measures. Citation: Arthurs E, Steele RJ, Hudson M, Baron M, Thombs BD, et al. (2012) Are Scores on English and French Versions of the PHQ-9 Comparable? An Assessment of Differential Item Functioning. PLoS ONE 7(12): e52028. doi: 10.1371/journal.pone.0052028