Title | ||
---|---|---|
MetaProb 2: Improving Unsupervised Metagenomic Binning with Efficient Reads Assembly Using Minimizers |
Abstract | ||
---|---|---|
Current technologies allow the sequencing of microbial communities directly from the environment without prior culturing. One of the major problems when analyzing a microbial sample is to taxonomically annotate its reads to identify the species it contains. Taxonomic analysis of microbial communities requires reads clustering, a process referred to as binning. The major problems of metagenomics reads binning are the lack of taxonomically related genomes in existing reference databases, the uneven abundance ratio of species, and sequencing errors. In this paper we present MetaProb 2 an unsupervised binning method based on reads assembly and probabilistic k-mers statistics. The novelties of MetaProb 2 are the use of minimizers to efficiently assemble reads into unitigs and a community detection algorithm based on graph modularity to cluster unitigs and to detect representative unitigs. The effectiveness of MetaProb 2 is demonstrated in both simulated and synthetic datasets in comparison with state-of-art binning tools such as MetaProb, AbundanceBin, Bimeta and MetaCluster. Available at: https://github.com/frankandreace/metaprob2. |
Year | DOI | Venue |
---|---|---|
2020 | 10.1007/978-3-030-79290-9_2 | COMPUTATIONAL ADVANCES IN BIO AND MEDICAL SCIENCES |
Keywords | DocType | Volume |
Metagenomic, Reads binning, Reads assembly with minimizers, k-mers statistics | Conference | 12686 |
ISSN | Citations | PageRank |
0302-9743 | 0 | 0.34 |
References | Authors | |
0 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
F. Andreace | 1 | 0 | 0.34 |
Cinzia Pizzi | 2 | 139 | 15.73 |
Matteo Comin | 3 | 0 | 0.34 |