Title
SCGid, a consensus approach to contig filtering and genome prediction from single cell sequencing libraries of uncultured eukaryotes.
Abstract
Motivation: Whole-genome sequencing of uncultured eukaryotic genomes is complicated by difficulties in acquiring sufficient amounts of tissue. Single-cell genomics (SCG) by multiple displacement amplification provides a technical workaround, yielding whole-genome libraries which can be assembled de novo. Downsides of multiple displacement amplification include coverage biases and exacerbation of contamination. These factors affect assembly continuity and fidelity, complicating discrimination of genomes from contamination and noise by available tools. Uncultured eukaryotes and their relatives are often underrepresented in large sequence data repositories, further impairing identification and separation. Results: We compare the ability of filtering approaches to remove contamination and resolve eukaryotic draft genomes from SCG metagenomes, finding significant variation in outcomes. To address these inconsistencies, we introduce a consensus approach that is codified in the SCGid software package. SCGid parallelly filters assemblies using different approaches, yielding three intermediate drafts from which consensus is drawn. Using genuine and mock SCG metagenomes, we show that our approach corrects for variation among draft genomes predicted by individual approaches and outperforms them in recapitulating published drafts in a fast and repeatable way, providing a useful alternative to available methods and manual curation.
Year
DOI
Venue
2020
10.1093/bioinformatics/btz866
BIOINFORMATICS
DocType
Volume
Issue
Journal
36
7
ISSN
Citations 
PageRank 
1367-4803
0
0.34
References 
Authors
0
3
Name
Order
Citations
PageRank
Kevin R Amses100.34
William J Davis200.34
Timothy Y James300.34