Title
A novel and well-defined benchmarking method for second generation read mapping.
Abstract
Second generation sequencing technologies yield DNA sequence data at ultra high-throughput. Common to most biological applications is a mapping of the reads to an almost identical or highly similar reference genome. The assessment of the quality of read mapping results is not straightforward and has not been formalized so far. Hence, it has not been easy to compare different read mapping approaches in a unified way and to determine which program is the best for what task.We present a new benchmark method, called Rabema (Read Alignment BEnchMArk), for read mappers. It consists of a strict definition of the read mapping problem and of tools to evaluate the result of arbitrary read mappers supporting the SAM output format.We show the usefulness of the benchmark program by performing a comparison of popular read mappers. The tools supporting the benchmark are licensed under the GPL and available from http://www.seqan.de/projects/rabema.html.
Year
DOI
Venue
2011
10.1186/1471-2105-12-210
BMC Bioinformatics
Keywords
Field
DocType
molecular biology,high throughput,computer science,algorithms,dna sequence
Edit distance,Well-defined,DNA sequencing theory,Computer science,DNA sequencing,Computational biology,Bioinformatics,Genetics,DNA microarray,Benchmarking,Reference genome
Journal
Volume
Issue
ISSN
12
1
1471-2105
Citations 
PageRank 
References 
27
0.99
7
Authors
4
Name
Order
Citations
PageRank
Manuel Holtgrewe1794.05
Anne-katrin Emde21016.06
David Weese325217.79
Knut Reinert41020105.87