Title
Identification of mixups among DNA sequencing plates.
Abstract
Motivation: During the process of high-throughput genome sequencing there are opportunities for mixups of reagents and data associated with particular projects. The sequencing templates or sequence data generated for an assembly may become contaminated with reagents or sequences from another project, resulting in poorer quality and inaccurate assemblies. Results: We have developed a system to assess sequence assemblies and monitor for laboratory mixups. We describe several methods for testing the consistency of assemblies and resolving mixed ones. We use statistical tests to evaluate the distribution of sequencing reads from different plates into contigs, and a graph-based approach to resolve situations where data has been inappropriately combined. While these methods have been designed for use in a high-throughput DNA sequencing environment processing thousands of clones, they can be applied in any situation where distinct sequencing projects are performed at redundant coverage.
Year
DOI
Venue
2002
10.1093/bioinformatics/18.11.1418
BIOINFORMATICS
Keywords
Field
DocType
statistical test,genome sequence,dna sequence,high throughput
2 base encoding,Data mining,Shotgun sequencing,Hybrid genome assembly,DNA sequencing theory,Alignment-free sequence analysis,Computer science,Contig,DNA sequencing,Bioinformatics,Sequence assembly
Journal
Volume
Issue
ISSN
18
11.0
1367-4803
Citations 
PageRank 
References 
2
0.53
3
Authors
5
Name
Order
Citations
PageRank
Nikola Stojanovic1398.53
Jean L Chang220.53
Jessica Lehoczky361.12
Michael C. Zody4323.90
Ken Dewar521.55