Mapping reads on a genomic sequence: an algorithmic overview and a practical comparative analysis. - Citegraph

Paper Info

Title
Mapping reads on a genomic sequence: an algorithmic overview and a practical comparative analysis.

Abstract
Mapping short reads against a reference genome is classically the first step of many next-generation sequencing data analyses, and it should be as accurate as possible. Because of the large number of reads to handle, numerous sophisticated algorithms have been developped in the last 3 years to tackle this problem. In this article, we first review the underlying algorithms used in most of the existing mapping tools, and then we compare the performance of nine of these tools on a well controled benchmark built for this purpose. We built a set of reads that exist in single or multiple copies in a reference genome and for which there is no mismatch, and a set of reads with three mismatches. We considered as reference genome both the human genome and a concatenation of all complete bacterial genomes. On each dataset, we quantified the capacity of the different tools to retrieve all the occurrences of the reads in the reference genome. Special attention was paid to reads uniquely reported and to reads with multiple hits.

Year	DOI	Venue
2012	10.1089/cmb.2012.0022	JOURNAL OF COMPUTATIONAL BIOLOGY
Keywords	Field	DocType
NGS,benchmarking,short read alignment,Burrows-Wheeler Transform,suffix tree,suffix array,hashing,spaced seeds	Data mining,Hybrid genome assembly,Burrows–Wheeler transform,Genomics,Suffix array,Concatenation,Human genome,Bioinformatics,Bacterial genome size,Mathematics,Reference genome	Journal
Volume	Issue	ISSN
19.0	6	1066-5277
Citations	PageRank	References
16	0.73	13
Authors
6

Authors (6 rows)

Cited by (16 rows)

References (13 rows)

Name	Order	Citations	PageRank
S Schbath	1	303	40.02
Véronique Martin	2	29	3.10
Matthias Zytnicki	3	166	8.92
Julien Fayolle	4	21	1.84
Valentin Loux	5	16	2.09
Jean-françois Gibrat	6	126	6.08

1