Title
New algorithms for fixed-length approximate string matching and approximate circular string matching under the Hamming distance.
Abstract
This paper proposes new algorithms for fixed-length approximate string matching and approximate circular string matching under the Hamming distance. Fixed-length approximate string matching and approximate circular string matching are special cases of approximate string matching and have numerous direct applications in bioinformatics and text searching. Firstly, a counter-vector-mismatches (CVM) algorithm is proposed to solve fixed-length approximate string matching with -mismatches. The development of CVM algorithm is based on the parallel summation of counters located in the same machine word. Secondly, a parallel counter-vector-mismatches (PCVM) algorithm is proposed to accelerate CVM algorithm in parallel. The PCVM algorithm is integrated into two-level parallelisms that exploit not only word-level parallelism but also data parallelism via parallel environments such as multi-core processors and graphics processing units (GPUs). In the particular case of adopting GPUs, a shared-mem parallel counter-vector-mismatches (PCVMsmem) scheme can be implemented from PCVM algorithm. The PCVMsmem scheme can exploit the memory model of GPUs to optimize performance of PCVM algorithm. Finally, this paper shows several methods to adopt CVM and PCVM algorithms in case the input pattern is in circular structure. In the experiments with real DNA packages, our proposed algorithms and scheme work greatly faster than previous bit-vector-mismatches and parallel bit-vector-mismatches algorithms.
Year
DOI
Venue
2018
https://doi.org/10.1007/s11227-017-2192-6
The Journal of Supercomputing
Keywords
Field
DocType
Approximate string matching,Circular string,Hamming distance,Graphics processing units,Parallel computing
Computer science,Algorithm,Circular string,Hamming distance,Approximate string matching
Journal
Volume
Issue
ISSN
74
5
0920-8542
Citations 
PageRank 
References 
0
0.34
0
Authors
3
Name
Order
Citations
PageRank
ThienLuan Ho101.69
Seungrohk Oh2416.04
Hyunjin Kim3424.84