Title
Estimates of allele-specific expression in Drosophila with a single genome sequence and RNA-seq data.
Abstract
Motivation: Genetic variation in cis-regulatory elements is an important cause of variation in gene expression. Cis-regulatory variation can be detected by using high-throughput RNA sequencing (RNA-seq) to identify differences in the expression of the two alleles of a gene. This requires that reads from the two alleles are equally likely to map to a reference genome(s), and that single-nucleotide polymorphisms (SNPs) are accurately called, so that reads derived from the different alleles can be identified. Both of these prerequisites can be achieved by sequencing the genomes of the parents of the individual being studied, but this is often prohibitively costly. Results: In Drosophila, we demonstrate that biases during read mapping can be avoided by mapping reads to two alternative genomes that incorporate SNPs called from the RNA-seq data. The SNPs can be reliably called from the RNA-seq data itself, provided any variants not found in high-quality SNP databases are filtered out. Finally, we suggest a way of measuring allele-specific expression (ASE) by crossing the line of interest to a reference line with a high-quality genome sequence. Combined with our bioinformatic methods, this approach minimizes mapping biases, allows poor-quality data to be identified and removed and aides in the biological interpretation of the data as the parent of origin of each allele is known. In conclusion, our results suggest that accurate estimates of ASE do not require the parental genomes of the individual being studied to be sequenced.
Year
DOI
Venue
2014
10.1093/bioinformatics/btu342
BIOINFORMATICS
Field
DocType
Volume
Genome,Gene,Allele,RNA-Seq,Biology,Genetic variation,Whole genome sequencing,Single-nucleotide polymorphism,Bioinformatics,Genetics,Reference genome
Journal
30
Issue
ISSN
Citations 
18
1367-4803
2
PageRank 
References 
Authors
0.43
6
3
Name
Order
Citations
PageRank
Andrew Quinn120.43
Punita Juneja220.43
Francis M Jiggins320.43