Disentangling homeologous contigs in allo-tetraploid assembly: application to durum wheat

作者:Ranwez Vincent*; Holtz Yan; Sarah Gautier; Ardisson Morgane; Santoni Sylvain; Glemin Sylvain; Tavaud Pirra Muriel; David Jacques
来源:BMC Bioinformatics, 2013, 14(Suppl 15): S15.
DOI:10.1186/1471-2105-14-S15-S15

摘要

Background: Using Next Generation Sequencing, SNP discovery is relatively easy on diploid species and still hampered in polyploid species by the confusion due to homeology. We develop HomeoSplitter; a fast and effective solution to split original contigs obtained by RNAseq into two homeologous sequences. It uses the differential expression of the two homeologous genes in the RNA. We verify that the new sequences are closer to the diploid progenitors of the allopolyploid species than the original contig. By remapping original reads on these new sequences, we also verify that the number of valuable detected SNPs has significantly increased. Thirty accessions of the tetraploid durum wheat (Triticum turgidum, A and B genomes) were sequenced in pooled cDNA libraries. Reads were assembled in a de novo durum assembly. Transcriptomes of the diploid species, Aegilops speltoides (close B genome) and Triticum urartu (A genome) were used as reference to benchmark the method. Results: HomeoSplitter is a fast and effective solution to disentangle homeologous sequences based on a maximum likelihood optimization. On a benchmark set of 2,505 clusters containing homologous sequences of urartu, speltoides and durum, HomeoSplitter was efficient to build sequences closer to the diploid references and increased the number of valuable SNPs from 188 out of 1,360 SNPs detected when mapping the reads on the de novo durum assembly to 762 out of 1,620 SNPs when mapping on HomeoSplitter contigs. Conclusions: The HomeoSplitter program is freely available at http://bioweb.supagro.inra.fr/homeoSplitter/. This work provides a practical solution to the complex problem of disentangling homeologous transcripts in allo-tetraploids, which further allows an improved SNP detection.

  • 出版日期2013-10-15

全文