Title | ||
---|---|---|
Minirmd: Accurate And Fast Duplicate Removal Tool For Short Reads Via Multiple Minimizers |
Abstract | ||
---|---|---|
A Summary: Removing duplicate and near-duplicate reads, generated by high-throughput sequencing technologies, is able to reduce computational resources in downstream applications. Here we develop minirmd, a de novo tool to remove duplicate reads via multiple rounds of clustering using different length of minimizer. Experiments demonstrate that minirmd removes more near-duplicate reads than existing clustering approaches and is faster than existing multi-core tools. To the best of our knowledge, minirmd is the first tool to remove near-duplicates on reverse-complementary strand. |
Year | DOI | Venue |
---|---|---|
2021 | 10.1093/bioinformatics/btaa915 | BIOINFORMATICS |
DocType | Volume | Issue |
Journal | 37 | 11 |
ISSN | Citations | PageRank |
1367-4803 | 1 | 0.35 |
References | Authors | |
0 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yuansheng Liu | 1 | 3 | 1.73 |
Xiaocai Zhang | 2 | 1 | 0.35 |
quan zou | 3 | 558 | 67.61 |
Xiangxiang Zeng | 4 | 589 | 50.79 |