Title
Optimal pooling for genome re-sequencing with ultra-high-throughput short-read technologies.
Abstract
New generation sequencing technologies offer unique opportunities and challenges for re-sequencing studies. In this article, we focus on re-sequencing experiments using the Solexa technology, based on bacterial artificial chromosome (BAC) clones, and address an experimental design problem. In these specific experiments, approximate coordinates of the BACs on a reference genome are known, and fine-scale differences between the BAC sequences and the reference are of interest. The high-throughput characteristics of the sequencing technology makes it possible to multiplex BAC sequencing experiments by pooling BACs for a cost-effective operation. However, the way BACs are pooled in such re-sequencing experiments has an effect on the downstream analysis of the generated data, mostly due to subsequences common to multiple BACs. The experimental design strategy we develop in this article offers combinatorial solutions based on approximation algorithms for the well-known max n-cut problem and the related max n-section problem on hypergraphs. Our algorithms, when applied to a number of sample cases give more than a 2-fold performance improvement over random partitioning.
Year
DOI
Venue
2008
10.1093/bioinformatics/btn173
ISMB
Keywords
Field
DocType
solexa technology,well-known max n-cut problem,re-sequencing study,genome re-sequencing,multiple bacs,ultra-high-throughput short-read technology,re-sequencing experiment,bac sequence,bac sequencing experiment,sequencing technology,related max n-section problem,experimental design problem,sequence alignment,cost effectiveness,algorithms,high throughput,experimental design,bacterial artificial chromosome
Bacterial artificial chromosome,Genome,Approximation algorithm,Data mining,Computer science,Pooling,Constraint graph,Multiplex,Bioinformatics,Reference genome,Performance improvement
Conference
Volume
Issue
ISSN
24
13
1367-4811
Citations 
PageRank 
References 
8
1.40
8
Authors
4
Name
Order
Citations
PageRank
Iman Hajirasouliha120717.62
Fereydoun Hormozdiari227823.16
S Cenk Sahinalp339334.96
Inanc Birol4789.34