A latent variable model approach to estimating systematic bias in the oversampling method

Hauner Katherina K<sup>*</sup>; Zinbarg Richard E; Revelle William

doi:10.3758/s13428-013-0402-6

摘要

The method of oversampling data from a preselected range of a variable's distribution is often applied by researchers who wish to study rare outcomes without substantially increasing sample size. Despite frequent use, however, it is not known whether this method introduces statistical bias due to disproportionate representation of a particular range of data. The present study employed simulated data sets to examine how oversampling introduces systematic bias in effect size estimates (of the relationship between oversampled predictor variables and the outcome variable), as compared with estimates based on a random sample. In general, results indicated that increased oversampling was associated with a decrease in the absolute value of effect size estimates. Critically, however, the actual magnitude of this decrease in effect size estimates was nominal. This finding thus provides the first evidence that the use of the oversampling method does not systematically bias results to a degree that would typically impact results in behavioral research. Examining the effect of sample size on oversampling yielded an additional important finding: For smaller samples, the use of oversampling may be necessary to avoid spuriously inflated effect sizes, which can arise when the number of predictor variables and rare outcomes is comparable.

出版日期2014-9
单位西北大学

全文

访问全文

收藏分享被引(15) 浏览

更新时间：2024-04-22 03:11

A latent variable model approach to estimating systematic bias in the oversampling method

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友