Using cascading Bloom filters to improve the memory usage for de Brujin graphs

Salikhov Kamil; Sacomoto Gustavo; Kucherov Gregory<sup>*</sup>

doi:10.1186/1748-7188-9-2

摘要

Background: De Brujin graphs are widely used in bioinformatics for processing next-generation sequencing data. Due to a very large size of NGS datasets, it is essential to represent de Bruijn graphs compactly, and several approaches to this problem have been proposed recently. %26lt;br%26gt;Results: In this work, we show how to reduce the memory required by the data structure of Chikhi and Rizk (WABI%26apos; 12) that represents de Brujin graphs using Bloom filters. Our method requires 30% to 40% less memory with respect to their method, with insignificant impact on construction time. At the same time, our experiments showed a better query time compared to the method of Chikhi and Rizk. %26lt;br%26gt;Conclusion: The proposed data structure constitutes, to our knowledge, currently the most efficient practical representation of de Bruijn graphs.

出版日期2014-2-24

全文

访问全文

收藏分享被引(46) 浏览

更新时间：2024-05-05 19:35

Using cascading Bloom filters to improve the memory usage for de Brujin graphs

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友