Title
Practical and Efficient Algorithms for Degenerate and Weighted Sequences Derived from High Throughput Sequencing Technologies
Abstract
High throughput, (or next generation) sequencing technologies have opened new and exciting opportunities in the use of DNA sequences. The new emerging technologies mark the beginning of a new era of high throughput short read sequencing: they have the potential to assemble a bacterial genome during a single experiment and at a moderate cost. In this paper, we address the problem of efficiently mapping millions of degenerate and weighted sequences to a reference genome with respect to whether they occur exactly once in the genome or not, and by taking probability scores into consideration. In particular, we define and solve the Massive Exact and Approximate Unique Pattern Matching problem for degenerate and weighted sequences derived from high throughput sequencing technologies.
Year
DOI
Venue
2009
10.1109/IJCBS.2009.48
IJCBS
Keywords
Field
DocType
moderate cost,dna sequence,high throughput,weighted sequences,weighted sequence,exciting opportunity,bacterial genome,high throughput sequencing technologies,sequencing technology,new era,approximate unique pattern matching,reference genome,efficient algorithms,pattern matching,dna,genomics,next generation sequencing,dna sequences,molecular biophysics,probability,microorganisms,bioinformatics,emerging technology
Genome,Hybrid genome assembly,Computer science,Algorithm,Genomics,DNA sequencing,Bioinformatics,Throughput,Pattern matching,Reference genome,Bacterial genome size
Conference
Citations 
PageRank 
References 
0
0.34
8
Authors
4
Name
Order
Citations
PageRank
Pavlos Antoniou135317.04
Costas S. Iliopoulos21534167.43
Laurent Mouchard325125.07
Solon P. Pissis428157.09