Title
Improving Metagenomic Classification using Discriminative k-mers from Sequencing Data
Abstract
The major problem when analyzing a metagenomic sample is to taxonomically annotate its reads in order to identify the species they contain. Most of the methods currently available focus on the classification of reads using a set of reference genomes and their k-mers. While in terms of precision these methods have reached percentages of correctness close to perfection, in terms of recall (the actual number of classified reads) the performances fall at around 50%. One of the reasons is the fact that the sequences in a sample can be very different from the corresponding reference genome, e.g. viral genomes are highly mutated. To address this issue, in this paper we study the problem of metagenomic reads classification by improving the reference k-mers library with novel discriminative k-mers from the input sequencing reads. We evaluated the performance in different conditions against several other tools and the results showed an improved F-measure, especially when close reference genomes are not available.
Year
DOI
Venue
2020
10.1007/978-3-030-57821-3_7
ISBRA
Keywords
DocType
Citations 
Metagenomic Reads Classification,Discriminative k-mers,Minimizers
Conference
0
PageRank 
References 
Authors
0.34
0
2
Name
Order
Citations
PageRank
D. Storato100.34
Matteo Comin219120.94