A Mixed-Effects Model for Powerful Association Tests in Integrative Functional Genomics

作者:Su Yu Ru; Di C*******; Bien Stephanie; Huang Licai; Dong Xinyuan; Abecasis Goncalo; Berndt Sonja; Bezieau Stephane; Brenner Hermann; Caan Bette; Casey Graham; Chang Claude Jenny; Chanock Stephen; Chen Sai; Connolly Charles; Curtis Keith; Figueiredo Jane; Gala Manish; Gallinger Steven; Harrison Tabitha; Hoffmeister Michael; Hopper John; Huyghe Jeroen R; Jenkins Mark; Joshi Amit; Le Marchand Loic; Newcomb Polly; Nickerson Deborah; Potter John; Schoen Robert
来源:American Journal of Human Genetics, 2018, 102(5): 904-919.
DOI:10.1016/j.ajhg.2018.03.019

摘要

Genome-wide association studies (GWASs) have successfully identified thousands of genetic variants for many complex diseases; however, these variants explain only a small fraction of the heritability. Recently, genetic association studies that leverage external transcriptome data have received much attention and shown promise for discovering novel variants. One such approach, PrediXcan, is to use predicted gene expression through genetic regulation. However, there are limitations in this approach. The predicted gene expression may be biased, resulting from regularized regression applied to moderately sample-sized reference studies. Further, some variants can individually influence disease risk through alternative functional mechanisms besides expression. Thus, testing only the association of predicted gene expression as proposed in PrediXcan will potentially lose power. To tackle these challenges, we consider a unified mixed effects model that formulates the association of intermediate phenotypes such as imputed gene expression through fixed effects, while allowing residual effects of individual variants to be random. We consider a set-based score testing framework, MiST (mixed effects score test), and propose two data-driven combination approaches to jointly test for the fixed and random effects. We establish the asymptotic distributions, which enable rapid calculation of p values for genome-wide analyses, and provide p values for fixed and random effects separately to enhance interpretability over GWASs. Extensive simulations demonstrate that our approaches are more powerful than existing ones. We apply our approach to a large-scale GWAS of colorectal cancer and identify two genes, POU5FIB and ATF1, which would have otherwise been missed by PrediXcan, after adjusting for all known loci.

  • 出版日期2018-5-3