Title
RLALIGN: A Reinforcement Learning Approach for Multiple Sequence Alignment
Abstract
Multiple sequence alignment (MSA) is one of the best studied problems in bioinformatics because of the broad set of genomics, proteomics, and evolutionary analyses that rely on it. Yet the problem is NP-hard and existing heuristics are imperfect. Reinforcement learning (RL) techniques have emerged recently as a potential solution to a wide diversity of computational problems, but have yet to be applied to MSA. In this paper, we describe RLALIGN, a method to solve the MSA problem using RL. RLALIGN is based on Asynchronous Advantage Actor Critic (A3C), a cutting-edge RL framework. Due to the absence of a goal state, however, it required several important modifications. RLALIGN can be trained to accurately align moderate-length sequences, and various heuristics allow it to scale to longer sequences. The accuracy of the alignments produced is on par with, and often better than those of well established alignment algorithms. Overall, our work demonstrates the potential of RL approaches for complex combinatorial problems such as MSA. RLALIGN will prove useful for realignment tasks, where portions of a larger alignment need to be optimized. Unlike classical algorithms, RLALIGN is incognizant to the nature of the scoring scheme, leading to easy generalization to a variety of problem variants.
Year
DOI
Venue
2018
10.1109/BIBE.2018.00019
2018 IEEE 18th International Conference on Bioinformatics and Bioengineering (BIBE)
Keywords
Field
DocType
bioinformatics,machine learning,reinforcement learning,multiple sequence alignment
Asynchronous communication,Computational problem,Imperfect,Computer science,Genomics,Heuristics,Artificial intelligence,Multiple sequence alignment,Machine learning,Reinforcement learning
Conference
ISSN
ISBN
Citations 
2159-5410
978-1-5386-5043-1
0
PageRank 
References 
Authors
0.34
0
3
Name
Order
Citations
PageRank
Ramchalam Kinattinkara Ramakrishnan100.68
Jaspal Singh200.68
Mathieu Blanchette363162.65