Title
Sequoya: Multi-objective multiple sequence alignment in Python.
Abstract
Motivation: Multiple sequence alignment (MSA) consists of finding the optimal alignment of three or more biological sequences to identify highly conserved regions that may be the result of similarities and relationships between the sequences. MSA is an optimization problem with NP-hard complexity (non-deterministic polynomial-time hardness), because the time needed to find optimal alignments raises exponentially along with the number of sequences and their length. Furthermore, the problem becomes multiobjective when more than one score is considered to assess the quality of an alignment, such as maximizing the percentage of totally conserved columns and minimizing the number of gaps. Our motivation is to provide a Python tool for solving MSA problems using evolutionary algorithms, a nonexact stochastic optimization approach that has proven to be effective to solve multiobjective problems. Results: The software tool we have developed, called Sequoya, is written in the Python programming language, which offers a broad set of libraries for data analysis, visualization and parallelism. Thus, Sequoya offers a graphical tool to visualize the progress of the optimization in real time, the ability to guide the search toward a preferred region in run-time, parallel support to distribute the computation among nodes in a distributed computing system, and a graphical component to assist in the analysis of the solutions found at the end of the optimization.
Year
DOI
Venue
2020
10.1093/bioinformatics/btaa257
BIOINFORMATICS
DocType
Volume
Issue
Journal
36
12
ISSN
Citations 
PageRank 
1367-4803
0
0.34
References 
Authors
0
3
Name
Order
Citations
PageRank
Antonio Benítez-Hidalgo111.71
Antonio J. Nebro2111854.62
José F Aldana Montes300.34