ASGART: fast and parallel genome scale segmental duplications mapping

作者:Delehelle Franklin; Cussat Blanc Sylvain; Alliot Jean Marc; Luga Herve Prime; Balaresque Patricia*
来源:Bioinformatics, 2018, 34(16): 2708-2714.
DOI:10.1093/bioinformatics/bty172

摘要

Motivation: Segmental Duplications (SDs) are DNA fragments longer than 1 kbp, distributed within and between chromosomes and sharing more than 90% identity. Although they hold a significant role in genomic fluidity and adaptability, many key questions about their intrinsic characteristics and mutability remain unsolved due to the persistent difficulty of sequencing highly duplicated genomic regions. The recent development of long and linked-read NGS technologies will increase the need to search for SDs in genomes newly sequenced with these technics. The main limitation of SD analysis will soon be the availability of efficient detection software, to retrieve and compare SD genomic component between species or lineages.
Results: In this paper, we present the open-source ASGART, 'A Segmental duplications Gathering And Refining Tool', developed to search for segmental duplications (SDs) in any assembled sequence. We have tested and benchmarked ASGART on five models organisms. Our results demonstrate ASGART's ability to extract SDs from any genome-wide sequence, regardless of genomic size or organizational complexity and quicker than any other software available.
Availability and implementation: The online version of ASGART is available at http://asgart.irit.fr. The source code of ASGART is available both on the ASGART website and at https://github.com/delehef/asgart.
Contact: patricia.balaresque@ univ-tlse3.fr
Supplementary information: Supplementary data are available at Bioinformatics online.

  • 出版日期2018-8-15