Title
Fast matching method for DNA sequences
Abstract
DNA sequences are the fundamental information for each species and a comparison between DNA sequences of different species is an important task. Since DNA sequences are very long and there exist many species, not only fast matching but also efficient storage is an important factor for DNA sequences. Thus, a fast string matching method suitable for encoded DNA sequences is needed. In this paper, we present a fast string matching method for encoded DNA sequences which does not decode DNA sequences while matching. We use four-characters-to-one-byte encoding and combine a suffix approach and a multipattern matching approach. Experimental results show that our method is about 5 times faster than AGREP and the fastest among known algorithms.
Year
DOI
Venue
2007
10.1007/978-3-540-74450-4_25
ESCAPE
Keywords
DocType
Volume
important task,fast string,encoded dna sequence,important factor,dna sequence,decode dna sequence,efficient storage,multipattern matching approach,suffix approach,different species,string matching,pattern matching
Conference
36
Issue
ISSN
ISBN
4
0302-9743
3-540-74449-5
Citations 
PageRank 
References 
7
0.63
21
Authors
3
Name
Order
Citations
PageRank
Jin Wook Kim16311.26
Eunsang Kim281.33
Kunsoo Park31396171.00