MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph

作者:Li, Dinghua; Liu, Chi-Man; Luo, Ruibang*; Sadakane, Kunihiko; Lam, Tak-Wah
来源:Bioinformatics, 2015, 31(10): 1674-1676.
DOI:10.1093/bioinformatics/btv033

摘要

MEGAHIT is a NGS de novo assembler for assembling large and complex metagenomics data in a time-and cost-efficient manner. It finished assembling a soil metagenomics dataset with 252Gbps in 44.1 and 99.6h on a single computing node with and without a graphics processing unit, respectively. MEGAHIT assembles the data as a whole, i.e. no pre-processing like partitioning and normalization was needed. When compared with previous methods on assembling the soil data, MEGAHIT generated a three-time larger assembly, with longer contig N50 and average contig length; furthermore, 55.8% of the reads were aligned to the assembly, giving a fourfold improvement.

  • 出版日期2015-5-15