Accuracy of Gene Scores when Pruning Markers by Linkage Disequilibrium

Dudbridge Frank; Newcombe Paul J

doi:10.1159/000446581

摘要

Objective: Gene scores are often used to model the combined effects of genetic variants. When variants are in linkage disequilibrium, it is common to prune all variants except the most strongly associated. This avoids duplicating information but discards information when variants have independent effects. However, joint modelling of correlated variants increases the sampling error in the gene score. In recent applications, joint modelling has offered only small improvements in accuracy over pruning. We aimed to quantify the relationship between pruning and joint modelling in relation to sample size. Methods: We derived the coefficient of determination R-2 for a gene score constructed from pruned markers, and for one constructed from correlated markers with jointly estimated effects. Results: Pruned scores tend to have slightly lower R2 than jointly modelled scores, but the differences are small at sample sizes up to 100,000. If the proportion of correlated variants is high, joint modelling can obtain modest improvements asymptotically. Conclusions: The small gains observed to date from joint modelling can be explained by sample size. As studies become larger, joint modelling will be useful for traits affected by many correlated variants, but the improvements may remain small. Pruning remains a useful heuristic for current studies.

出版日期2015

全文

访问全文

收藏分享被引(5) 浏览

更新时间：2021-04-09 04:56

Accuracy of Gene Scores when Pruning Markers by Linkage Disequilibrium

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友