A Pipeline for Classifying Relationships Using Dense SNP/SNV Data and Putative Pedigree Information

Zeng Zhen<sup>*</sup>; Weeks Daniel E; Chen Wei; Mukhopadhyay Nandita; Feingold Eleanor<sup>*</sup>

doi:10.1002/gepi.21948

摘要

When genome-wide association studies (GWAS) or sequencing studies are performed on family-based datasets, the genotype data can be used to check the structure of putative pedigrees. Even in datasets of putatively unrelated people, close relationships can often be detected using dense single-nucleotide polymorphism/variant (SNP/SNV) data. A number of methods for finding relationships using dense genetic data exist, but they all have certain limitations, including that they typically use average genetic sharing, which is only a subset of the available information. Here, we present a set of approaches for classifying relationships in GWAS datasets or large-scale sequencing datasets. We first propose an empirical method for detecting identity by descent segments in close relative pairs using un-phased dense SNP data and demonstrate how that information can assist in building a relationship classifier. We then develop a strategy to take advantage of putative pedigree information to enhance classification accuracy. Our methods are tested and illustrated with two datasets from two distinct populations. Finally, we propose classification pipelines for checking and identifying relationships in datasets containing a large number of small pedigrees. Genet Epidemiol 40: 161-171, 2016.

出版日期2016-2

全文

访问全文

收藏分享被引(2) 浏览

更新时间：2021-04-13 22:51

A Pipeline for Classifying Relationships Using Dense SNP/SNV Data and Putative Pedigree Information

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友