Using cascading Bloom filters to improve the memory usage for de Brujin graphs. - Citegraph

Paper Info

Title
Using cascading Bloom filters to improve the memory usage for de Brujin graphs.

Abstract
De Brujin graphs are widely used in bioinformatics for processing next-generation sequencing data. Due to a very large size of NGS datasets, it is essential to represent de Bruijn graphs compactly, and several approaches to this problem have been proposed recently.In this work, we show how to reduce the memory required by the data structure of Chikhi and Rizk (WABI'12) that represents de Brujin graphs using Bloom filters. Our method requires 30% to 40% less memory with respect to their method, with insignificant impact on construction time. At the same time, our experiments showed a better query time compared to the method of Chikhi and Rizk.The proposed data structure constitutes, to our knowledge, currently the most efficient practical representation of de Bruijn graphs.

Year	DOI	Venue
2014	10.1186/1748-7188-9-2	Algorithms for Molecular Biology
Keywords	Field	DocType
Next-generation sequencing, Genome assembly, de Brujin graph, Bloom filter	Bloom filter,Graph,Computer science,Theoretical computer science,De Bruijn sequence,Bioinformatics	Journal
Volume	Issue	ISSN
9	1	1748-7188
Citations	PageRank	References
28	1.24	7
Authors
3

Authors (3 rows)

Cited by (28 rows)

References (7 rows)

Name	Order	Citations	PageRank
Kamil Salikhov	1	33	2.69
Gustavo Sacomoto	2	45	5.81
Gregory Kucherov	3	1003	74.54

1