Title
Minimizing Genomic Duplication Episodes
Abstract
Background: The genomic duplication study is fundamental to understand the process of evolution. In evolutionary molecular biology, many approaches focus on discovering the occurrences of gene duplications and multiple gene duplication episodes and their locations in the Tree of Life. To reconstruct such episodes, one can cluster single gene duplications inferred by reconciling a set of gene trees with a species tree.Results: We propose an efficient quadratic time algorithm to solve the problem of genomic duplication clustering, in which input gene trees are rooted, episode locations are restricted to preserve the minimal number of single gene duplications, clustering rules are described by minimum episodes method, and the goal is based on the recently introduced new approach to minimize the maximal number of duplication episodes on a single path, called here the MP score. Based on our theoretical results, we show new algorithmic relationships between the MP score and the minimum episodes (ME) score, defined as the minimal number of duplication episodes.Conclusions: Our evaluation analysis on three empirical datasets demonstrates, that under the model in which the minimal number of duplications is preserved, the duplication clusterings with minimal MP score support the clusterings with the minimal total number of duplication episodes.
Year
DOI
Venue
2020
10.1016/j.compbiolchem.2020.107260
COMPUTATIONAL BIOLOGY AND CHEMISTRY
Keywords
DocType
Volume
Genomic duplication, Duplication episode, Minimum episodes problem, Reconciliation, Maximal path, Species tree
Journal
89
ISSN
Citations 
PageRank 
1476-9271
0
0.34
References 
Authors
0
3
Name
Order
Citations
PageRank
Jaroslaw Paszek173.23
Jerzy Tiuryn21210126.00
Pawel Górecki311214.26