Using a priori knowledge to align sequencing reads to their exact genomic position

作者:Bottcher Rene; Amberg Ronny; Ruzius F P; Guryev V; Verhaegh Wim F J; Beyerlein Peter; van der Zaag P J*
来源:Nucleic Acids Research, 2012, 40(16): e125.
DOI:10.1093/nar/gks393

摘要

The use of a priori knowledge in the alignment of targeted sequencing data is investigated using computational experiments. Adapting a Needleman-Wunsch algorithm to incorporate the genomic position information from the targeted capture, we demonstrate that alignment can be done to just the target region of interest. When in addition use is made of direct string comparison, an improvement of up to a factor of 8 in alignment speed compared to the fastest conventional aligner (Bowtie) is obtained. This results in a total alignment time in targeted sequencing of around 7 min for aligning approximately 56 million captured reads. For conventional aligners such as Bowtie, BWA or MAQ, alignment to just the target region is not feasible as experiments show that this leads to an additional 88% SNP calls, the vast majority of which are false positives (similar to 92%).

  • 出版日期2012-9

全文