Title
Three-stage quality control strategies for DNA re-sequencing data.
Abstract
Advances in next-generation sequencing (NGS) technologies have greatly improved our ability to detect genomic variants for biomedical research. In particular, NGS technologies have been recently applied with great success to the discovery of mutations associated with the growth of various tumours and in rare Mendelian diseases. The advance in NGS technologies has also created significant challenges in bioinformatics. One of the major challenges is quality control of the sequencing data. In this review, we discuss the proper quality control procedures and parameters for Illumina technology-based human DNA re-sequencing at three different stages of sequencing: raw data, alignment and variant calling. Monitoring quality control metrics at each of the three stages of NGS data provides unique and independent evaluations of data quality from differing perspectives. Properly conducting quality control protocols at all three stages and correctly interpreting the quality control results are crucial to ensure a successful and meaningful study.
Year
DOI
Venue
2014
10.1093/bib/bbt069
BRIEFINGS IN BIOINFORMATICS
Keywords
Field
DocType
sequencing,quality control,FASTQ,alignment,variant calling
Data mining,Data quality,Biology,Raw data,Genomic library,Quality control,Bioinformatics
Journal
Volume
Issue
ISSN
15
6
1467-5463
Citations 
PageRank 
References 
6
0.92
13
Authors
5
Name
Order
Citations
PageRank
Yan Guo17712.73
Fei Ye2173.44
Quanghu Sheng371.26
Travis Clark461.26
David C. Samuels5186.68