Artificial duplicate reads in sequencing data of 454 Genome Sequencer FLX System

作者:Dong Hui; Chen Yangyi; Shen Yan; Wang Shengyue; Zhao Guoping*; Jin Weirong
来源:Acta Biochimica et Biophysica Sinica, 2011, 43(6): 496-500.
DOI:10.1093/abbs/gmr030

摘要

The 454 Genome Sequencer (GS) FLX System is one of the next-generation sequencing systems featured by long reads, high accuracy, and ultra-high throughput. Based on the mechanism of emulsion PCR, a unique DNA template would only generate a unique sequence read after being amplified and sequenced on GS FLX. However, biased amplification of DNA templates might occur in the process of emulsion PCR, which results in production of artificial duplicate reads. Under the condition that each DNA template is unique to another, 3.49%-18.14% of total reads in GS FLX-sequencing data were found to be artificial duplicate reads. These duplicate reads may lead to misunderstanding of sequencing data and special attention should be paid to the potential biases they introduced to the data.

  • 出版日期2011-6
  • 单位上海人类基因组研究中心