Title
High Performance Computing for Haplotyping - Models and Platforms.
Abstract
The reconstruction of the haplotype pair for each chromosome is a hot topic in Bioinformatics and Genome Analysis. In Haplotype Assembly (HA), all heterozygous Single Nucleotide Polymorphisms (SNPs) have to be assigned to exactly one of the two chromosomes. In this work, we outline the state-of-the-art on HA approaches and present an in-depth analysis of the computational performance of GenHap, a recent method based on Genetic Algorithms. GenHap was designed to tackle the computational complexity of the HA problem by means of a divide-et-impera strategy that effectively leverages multi-core architectures. In order to evaluate GenHap's performance, we generated different instances of synthetic (yet realistic) data exploiting empirical error models of four different sequencing platforms (namely, Illumina NovaSeq, Roche/454, PacBio RS II and Oxford Nanopore Technologies MinION). Our results show that the processing time generally decreases along with the read length, involving a lower number of sub-problems to be distributed on multiple cores.
Year
DOI
Venue
2018
10.1007/978-3-030-10549-5_51
Lecture Notes in Computer Science
Keywords
Field
DocType
Future-generation sequencing,Genome Analysis Haplotype Assembly,High Performance Computing,Master-Slave paradigm
Genome,Minion,Supercomputer,Nanopore sequencing,Computer science,Parallel computing,Haplotype,Genetic algorithm,Computational complexity theory
Conference
Volume
ISSN
Citations 
11339
0302-9743
0
PageRank 
References 
Authors
0.34
15
9
Name
Order
Citations
PageRank
Andrea Tangherloni1407.88
Leonardo Rundo2256.40
Simone Spolaor394.97
Marco S. Nobile414323.69
Ivan Merelli529435.36
Daniela Besozzi639139.10
Giancarlo Mauri72106297.38
Paolo Cazzaniga823527.16
Pietro Liò955099.98