Title
Enriching targeted sequencing experiments for rare disease alleles
Abstract
Motivation: Next-generation targeted resequencing of genome-wide association study (GWAS)-associated genomic regions is a common approach for follow-up of indirect association of common alleles. However, it is prohibitively expensive to sequence all the samples from a well-powered GWAS study with sufficient depth of coverage to accurately call rare genotypes. As a result, many studies may use next-generation sequencing for single nucleotide polymorphism (SNP) discovery in a smaller number of samples, with the intent to genotype candidate SNPs with rare alleles captured by resequencing. This approach is reasonable, but may be inefficient for rare alleles if samples are not carefully selected for the resequencing experiment. Results: We have developed a probability-based approach, SampleSeq, to select samples for a targeted resequencing experiment that increases the yield of rare disease alleles substantially over random sampling of cases or controls or sampling based on genotypes at associated SNPs from GWAS data. This technique allows for smaller sample sizes for resequencing experiments, or allows the capture of rarer risk alleles. When following up multiple regions, SampleSeq selects subjects with an even representation of all the regions. SampleSeq also can be used to calculate the sample size needed for the resequencing to increase the chance of successful capture of rare alleles of desired frequencies. Software: http://biostat.mc.vanderbilt.edu/SampleSeq Contact: chun.li@vanderbilt.edu Supplementary information:Supplementary data are available at Bioinformatics online.
Year
DOI
Venue
2011
10.1093/bioinformatics/btr324
Bioinformatics
Keywords
Field
DocType
genotype,probability,genome wide association study,alleles,computational biology,computer simulation,genome,algorithms
Genome,Genotype,Rare disease,Allele,Biology,Genome-wide association study,Single-nucleotide polymorphism,Bioinformatics,Genetics,SNP,Sample size determination
Journal
Volume
Issue
ISSN
27
15
1367-4803
Citations 
PageRank 
References 
2
0.80
0
Authors
3
Name
Order
Citations
PageRank
Todd L. Edwards1745.50
Zhuo Song232.10
Chun Li3638.28