Title
A de novo genome assembler based on MapReduce and bi-directed de Bruijn graph
Abstract
The next generation sequencing (NGS) techniques have enabled biologists to generate large DNA sequences in a high-throughput and low-cost way. Assembly of NGS reads still face great challenges due to the short reads and enormous high volume. In this paper, we presented a new assembler, called GAMR, which is based on bi-directed de Bruijn graph and implemented using MapReduce framework. We designed distributed algorithm for each step in GAMR, making it scalable in assembling large-scale genomes. We evaluated GAMR using GAGE's data and compared it against other NGS assemblers. The results showed GAMR assembled contigs and scaffolds with better accuracy and longer N50 values.
Year
DOI
Venue
2016
10.1109/BIBM.2016.7822494
2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
Keywords
Field
DocType
next generation sequencing (NGS),assembly,MapReduce,bi-directed de Bruijn graph
Genome,Computer science,Theoretical computer science,Genomics,Contig,Distributed algorithm,De Bruijn graph,DNA sequencing,Bioinformatics,Scalability
Conference
ISSN
ISBN
Citations 
2156-1125
978-1-5090-1612-9
0
PageRank 
References 
Authors
0.34
5
5
Name
Order
Citations
PageRank
Yuehua Zhang100.34
Pengfei Xuan2163.07
Yunsheng Wang311.04
Pradip K. Srimani487996.11
Feng Luo528426.03