DSRC 2-Industry-oriented compression of FASTQ files

作者:Roguski Lukasz; Deorowicz Sebastian*
来源:Bioinformatics, 2014, 30(15): 2213-2215.
DOI:10.1093/bioinformatics/btu208

摘要

Modern sequencing platforms produce huge amounts of data. Archiving them raises major problems but is crucial for reproducibility of results, one of the most fundamental principles of science. The widely used gzip compressor, used for reduction of storage and transfer costs, is not a perfect solution, so a few specialized FASTQ compressors were proposed recently. Unfortunately, they are often impractical because of slow processing, lack of support for some variants of FASTQ files or instability. We propose DSRC 2 that offers compression ratios comparable with the best existing solutions, while being a few times faster and more flexible.

  • 出版日期2014-8-1