Title
Efficiently detecting polymorphisms during the fragment assembly process.
Abstract
Current genomic sequence assemblers assume that the input data is derived from a single, homogeneous source. However, recent whole-genome shotgun sequencing projects have violated this assumption, resulting in input fragments covering the same region of the genome whose sequences differ due to polymorphic variation in the population. While single-nucleotide polymorphisms (SNPs) do not pose a significant problem to state-of-the-art assembly methods, these methods do not handle insertion/deletion (indel) polymorphisms of more than a few bases.This paper describes an efficient method for detecting sequence discrepencies due to polymorphism that avoids resorting to global use of more costly, less stringent affine sequence alignments. Instead, the algorithm uses graph-based methods to determine the small set of fragments involved in each polymorphism and performs more sophisticated alignments only among fragments in that set. Results from the incorporation of this method into the Celera Assembler are reported for the D. melanogaster, H. sapiens, and M. musculus genomes.
Year
DOI
Venue
2002
10.1093/bioinformatics/18.suppl_1.S294
ISMB
Keywords
Field
DocType
shotgun sequenc- ing,polymorphism.,contact: daniel.fasulo@celera.com keywords: whole-genome assembly,whole genome shotgun,polymorphism,nucleotides,sequence alignment,genome sequence
Affine transformation,Genome,Population,Shotgun sequencing,Computer science,Single-nucleotide polymorphism,Bioinformatics,Small set,Sequence assembly,Indel
Conference
Volume
ISSN
Citations 
18 Suppl 1
1367-4803
5
PageRank 
References 
Authors
0.58
1
4
Name
Order
Citations
PageRank
Daniel Fasulo1444.74
Aaron L. Halpern25514.04
Ian M. Dew32110.20
Clark M. Mobarry4339.26