Abstract | ||
---|---|---|
We study a method of seed-based lossless filtration for approximate string matching and related applications. The method is based on a simultaneous use of several spaced seeds rather than a single seed as studied by Burkhardt and Karkkainen (1). We present algorithms to com- pute several important parameters of seed families, study their combinatorial properties, and describe several techniques to construct efficient families. We also report a large-scale application of the proposed technique to the problem of oligonucleotide selection for an EST sequence database. |
Year | DOI | Venue |
---|---|---|
2004 | 10.1007/978-3-540-27801-6_22 | CPM |
Keywords | Field | DocType |
approximate string matching | Sequence database,Computer science,Algorithm,Theoretical computer science,Filtration,Approximate string matching,Pattern matching,Lossless compression | Conference |
Volume | Citations | PageRank |
3109 | 2 | 0.47 |
References | Authors | |
15 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Gregory Kucherov | 1 | 1003 | 74.54 |
Laurent Noé | 2 | 230 | 13.94 |
Mikhail A. Roytberg | 3 | 114 | 54.66 |