Title
Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data.
Abstract
White spruce (Picea glauca) is a dominant conifer of the boreal forests of North America, and providing genomics resources for this commercially valuable tree will help improve forest management and conservation efforts. Sequencing and assembling the large and highly repetitive spruce genome though pushes the boundaries of the current technology. Here, we describe a whole-genome shotgun sequencing strategy using two Illumina sequencing platforms and an assembly approach using the ABySS software. We report a 20.8 giga base pairs draft genome in 4.9 million scaffolds, with a scaffold N50 of 20,356 bp. We demonstrate how recent improvements in the sequencing technology, especially increasing read lengths and paired end reads from longer fragments have a major impact on the assembly contiguity. We also note that scalable bioinformatics tools are instrumental in providing rapid draft assemblies.The Picea glauca genome sequencing and assembly data are available through NCBI (Accession#: ALWZ0100000000 PID: PRJNA83435). http://www.ncbi.nlm.nih.gov/bioproject/83435.
Year
DOI
Venue
2013
10.1093/bioinformatics/btt178
Bioinformatics
Keywords
Field
DocType
illumina sequencing platform,assembly approach,picea glauca genome sequencing,picea glauca,current technology,assembly contiguity,gb white,assembly data,giga base pairs draft,sequencing technology,whole-genome shotgun,rapid draft assembly,genomics
Genome,Shotgun sequencing,Giga-,Biology,Illumina dye sequencing,Genomics,DNA sequencing,Bioinformatics,Sequence assembly
Journal
Volume
Issue
ISSN
29
12
1367-4811
Citations 
PageRank 
References 
10
0.70
7
Authors
24