Title
Indexing finite language representation of population genotypes
Abstract
We propose a way to index population genotype information together with the complete genome sequence, so that one can use the index to efficiently align a given sequence to the genome with all plausible genotype recombinations taken into account. This is achieved through converting a multiple alignment of individual genomes into a finite automaton recognizing all strings that can be read from the alignment by switching the sequence at any time. The finite automaton is indexed with an extension of Burrows-Wheeler transform to allow pattern search inside the plausible recombinant sequences. The size of the index stays limited, because of the high similarity of individual genomes. The index finds applications in variation calling and in primer design.
Year
DOI
Venue
2011
10.1007/978-3-642-23038-7_23
workshop on algorithms in bioinformatics
Keywords
DocType
Volume
dna sequence,finite automaton,pattern search,quantitative method,indexation,burrows wheeler transform,data structure,multiple alignment
Conference
abs/1010.2656
Citations 
PageRank 
References 
3
0.44
13
Authors
3
Name
Order
Citations
PageRank
Jouni Sirén122214.85
Niko Välimäki227614.53
Veli Mäkinen3158385.29