A new statistic and its power to infer membership in a genome-wide association study using genotype frequencies

作者:Jacobs Kevin B*; Yeager Meredith; Wacholder Sholom; Craig David; Kraft Peter; Hunter David J; Paschal Justin; Manolio Teri A; Tucker Margaret; Hoover Robert N; Thomas Gilles D; Chanock Stephen J; Chatterjee Nilanjan
来源:Nature Genetics, 2009, 41(11): 1253-U126.
DOI:10.1038/ng.455

摘要

Aggregate results from genome-wide association studies (GWAS)(1-3), such as genotype frequencies for cases and controls, were until recently often made available on public websites(4,5) because they were thought to disclose negligible information concerning an individual's participation in a study. Homer et al.(6) recently suggested that a method for forensic detection of an individual's contribution to an admixed DNA sample could be applied to aggregate GWAS data. Using a likelihood-based statistical framework, we developed an improved statistic that uses genotype frequencies and individual genotypes to infer whether a specific individual or any close relatives participated in the GWAS and, if so, what the participant's phenotype status is. Our statistic compares the logarithm of genotype frequencies, in contrast to that of Homer et al.(6), which is based on differences in either SNP probe intensity or allele frequencies. We derive the theoretical power of our test statistics and explore the empirical performance in scenarios with varying numbers of randomly chosen or top-associated SNPs.

  • 出版日期2009-11