Title
Constant-time and space-efficient unidirectional and bidirectional FM-indices using EPR-dictionaries.
Abstract
We introduce a new method for conducting an exact search in a uni- and bidirectional FM index in $mathcal{O}(1)$ time per step while using $mathcal{O}(log sigma cdot n) + o(log sigma cdot sigma cdot n)$ bits of space. This is done by replacing the wavelet tree by a new data structure, the emph{Enhanced Prefixsum Rank dictionary} (EPR-dictionary). To our knowledge this is the first constant time method for a search step in 2FM indices and a space improvement for FM indices. We implemented this method in the SeqAn C++ library and experimentally validated our theoretical results. In addition we compared our implementation with other freely available implementations of bidirectional indices and show that we are between $approx 2.7-4.6$ times faster. This will have a large impact for many bioinformatics applications that rely on practical implementations of (2)FM indices e.g. for read mapping.
Year
Venue
Field
2016
arXiv: Data Structures and Algorithms
Discrete mathematics,Data structure,Combinatorics,Approx,Electron paramagnetic resonance,Spacetime,Wavelet Tree,Sigma,FM-index,Mathematics
DocType
Volume
Citations 
Journal
abs/1608.02413
1
PageRank 
References 
Authors
0.36
0
3
Name
Order
Citations
PageRank
Christopher Pockrandt110.69
Marcel Ehrhardt210.36
Knut Reinert31020105.87