Joeres, RomanGesellschaft für Informatik2021-12-152021-12-152021978-3-88579-751-7https://dl.gi.de/handle/20.500.12116/37785Multiple sequence alignment (MSA) is one of the primal problems in biology and bioinformatics. The question of how to align multiple sequences correctly is crucial for many other fields of research, e.g., gaining information about the evolutionary distance of two or more sequences and therefore about their corresponding species, finding protein targets for drugs, or finding a drug for a certain target protein. Reinforcement learning (RL), and especially deep reinforcement learning (DRL), has become popular in recent years. To name just a few, DRL has shown major success in complex games such as Atari Games, Chess, and Go. We model the problem of aligning multiple sequences as a Markov decision process (MDP) and examine the performance of different (D)RL algorithms compared to state-of-the-art tools.enBioinformaticsMultiple Sequence AlignmentReinforcement LearningDeep Reinforcement LearningMultiple Sequence Alignment using Deep Reinforcement Learning1614-3213