Title
Hopper: software for automating data tracking and flow in DNA sequencing.
Abstract
Motivation: Genome-scale DNA sequencing is a multistep process in which large numbers of small template clones are propagated, purified, sequenced and analyzed on acrylamide gels. A significant challenge to these projects is the scale at which the data handling must be done. Hence, large-scale sequencing facilities will benefit from tracking template DNA information (purification methods, reaction and electrophoresis conditions) in a systematic fashion. A lack of software tools that support automated sample entry, and automatic data storage, retrieval and analysis are a major hindrance to recording and using laboratory workflow information to monitor the overall qualify of data production. Results: The UNIX file system has been used to prototype automation of the pow of data from the ABI sequencer to a data repository. Data ale automatically processed by a central Perl program, Hopper, which runs a series of programs that analyze data quality (read length estimate, fraction of indeterminate bases, and number of contaminating and repetitive sequences), assemble shotgun sequence data, and generates simple reports describing the results.
Year
DOI
Venue
1997
10.1093/bioinformatics/13.2.175
COMPUTER APPLICATIONS IN THE BIOSCIENCES
Keywords
Field
DocType
dna sequence,data quality,data handling,data storage
Data mining,Software design,Data quality,Computer science,Automation,Software,Information repository,Bioinformatics,Workflow,Group method of data handling,Perl,Database
Journal
Volume
Issue
ISSN
13
2
0266-7061
Citations 
PageRank 
References 
6
8.23
0
Authors
3
Name
Order
Citations
PageRank
Todd Smith168.23
C Abajian2109.65
Leroy Hood316545.56