Title
SeqAnt: A web service to rapidly identify and annotate DNA sequence variations.
Abstract
BACKGROUND: The enormous throughput and low cost of second-generation sequencing platforms now allow research and clinical geneticists to routinely perform single experiments that identify tens of thousands to millions of variant sites. Existing methods to annotate variant sites using information from publicly available databases via web browsers are too slow to be useful for the large sequencing datasets being routinely generated by geneticists. Because sequence annotation of variant sites is required before functional characterization can proceed, the lack of a high-throughput pipeline to efficiently annotate variant sites can act as a significant bottleneck in genetics research. RESULTS: SeqAnt (Sequence Annotator) is an open source web service and software package that rapidly annotates DNA sequence variants and identifies recessive or compound heterozygous loci in human, mouse, fly, and worm genome sequencing experiments. Variants are characterized with respect to their functional type, frequency, and evolutionary conservation. Annotated variants can be viewed on a web browser, downloaded in a tab-delimited text file, or directly uploaded in a BED format to the UCSC genome browser. To demonstrate the speed of SeqAnt, we annotated a series of publicly available datasets that ranged in size from 37 to 3,439,107 variant sites. The total time to completely annotate these data completely ranged from 0.17 seconds to 28 minutes 49.8 seconds. CONCLUSION: SeqAnt is an open source web service and software package that overcomes a critical bottleneck facing research and clinical geneticists using second-generation sequencing platforms. SeqAnt will prove especially useful for those investigators who lack dedicated bioinformatics personnel or infrastructure in their laboratories.
Year
DOI
Venue
2010
10.1186/1471-2105-11-471
BMC Bioinformatics
Keywords
Field
DocType
algorithms,dna sequence,microarrays,evolutionary conservation,high throughput,web service,genetic variation,genetics,genome sequence,internet,bioinformatics,genomics
Bottleneck,World Wide Web,Annotation,Biology,Genomics,DNA sequencing,Bioinformatics,Web service,Genetics,Molecular Sequence Annotation,DNA microarray,The Internet
Journal
Volume
Issue
ISSN
11
1
1471-2105
Citations 
PageRank 
References 
17
0.72
4
Authors
9