Title
Minirmd: Accurate And Fast Duplicate Removal Tool For Short Reads Via Multiple Minimizers
Abstract
A Summary: Removing duplicate and near-duplicate reads, generated by high-throughput sequencing technologies, is able to reduce computational resources in downstream applications. Here we develop minirmd, a de novo tool to remove duplicate reads via multiple rounds of clustering using different length of minimizer. Experiments demonstrate that minirmd removes more near-duplicate reads than existing clustering approaches and is faster than existing multi-core tools. To the best of our knowledge, minirmd is the first tool to remove near-duplicates on reverse-complementary strand.
Year
DOI
Venue
2021
10.1093/bioinformatics/btaa915
BIOINFORMATICS
DocType
Volume
Issue
Journal
37
11
ISSN
Citations 
PageRank 
1367-4803
1
0.35
References 
Authors
0
4
Name
Order
Citations
PageRank
Yuansheng Liu131.73
Xiaocai Zhang210.35
quan zou355867.61
Xiangxiang Zeng458950.79