A New Genotype Imputation Method with Tolerance to High Missing Rate and Rare Variants

作者:Yang, Yumei; Wang, Qishan; Chen, Qiang; Liao, Rongrong; Zhang, Xiangzhe; Yang, Hongjie; Zheng, Youmin; Zhang, Zhiwu*; Pan, Yuchun
来源:PLos One, 2014, 9(6): e101025.
DOI:10.1371/journal.pone.0101025

摘要

We report a novel algorithm, iBLUP, to impute missing genotypes by simultaneously and comprehensively using identity by descent and linkage disequilibrium information. The simulation studies showed that the algorithm exhibited drastically tolerance to high missing rate, especially for rare variants than other common imputation methods, e. g. BEAGLE and fastPHASE. At a missing rate of 70%, the accuracy of BEAGLE and fastPHASE dropped to 0.82 and 0.74 respectively while iBLUP retained an accuracy of 0.95. For minor allele, the accuracy of BEAGLE and fastPHASE decreased to -0.1 and 0.03, while iBLUP still had an accuracy of 0.61. We implemented the algorithm in a publicly available software package also named iBLUP. The application of iBLUP for processing real sequencing data in an outbred pig population was demonstrated.