An Efficient Algorithm for Mapping of Reads to a Genome Graph Using an Index Based on Hash Tables and Dynamic Programming
- Authors: Petrov S.N.1,2, Uroshlev L.A.1,2, Kasyanov A.S.1,2, Makeev V.Y.1,2,3,4
-
Affiliations:
- Vavilov Institute of General Genetics
- Moscow Institute of Physics and Technology (State University), Institutskiy per. 9
- Engelhardt Institute of Molecular Biology
- Research Institute for Genetics and Selection of Industrial Microorganisms
- Issue: Vol 63, No 3 (2018)
- Pages: 311-317
- Section: Molecular Biophysics
- URL: https://journals.rcsi.science/0006-3509/article/view/152606
- DOI: https://doi.org/10.1134/S0006350918030193
- ID: 152606
Cite item
Abstract
The problem of storage of the sequences of a number of closely related genomes and analysis of genome variations is considered. A genome graph with the structure of an acyclic directed graph is used to store matching sections of sequences and known variants. An algorithm for rapid mapping of reads to the genome graph is developed to align the individual nucleotide sequence fragments to the genome graph. The algorithm combines rapid searching using hash tables with the algorithm of dynamic programming and solves the problem of exponential growth in the number of paths on the graph. The implementation of the genome graph and the algorithm of the alignment of reads is developed. A comparison with the best-known programs with similar functionality is made.
Keywords
About the authors
S. N. Petrov
Vavilov Institute of General Genetics; Moscow Institute of Physics and Technology (State University), Institutskiy per. 9
Email: vsevolod.makeev@gmail.com
Russian Federation, ul. Gubkina 3, Moscow, 119991; Dolgoprudny, Moscow oblast, 141701
L. A. Uroshlev
Vavilov Institute of General Genetics; Moscow Institute of Physics and Technology (State University), Institutskiy per. 9
Email: vsevolod.makeev@gmail.com
Russian Federation, ul. Gubkina 3, Moscow, 119991; Dolgoprudny, Moscow oblast, 141701
A. S. Kasyanov
Vavilov Institute of General Genetics; Moscow Institute of Physics and Technology (State University), Institutskiy per. 9
Email: vsevolod.makeev@gmail.com
Russian Federation, ul. Gubkina 3, Moscow, 119991; Dolgoprudny, Moscow oblast, 141701
V. Yu. Makeev
Vavilov Institute of General Genetics; Moscow Institute of Physics and Technology (State University), Institutskiy per. 9; Engelhardt Institute of Molecular Biology; Research Institute for Genetics and Selection of Industrial Microorganisms
Author for correspondence.
Email: vsevolod.makeev@gmail.com
Russian Federation, ul. Gubkina 3, Moscow, 119991; Dolgoprudny, Moscow oblast, 141701; ul. Vavilova 32, Moscow, 119991; Pervyj Dorozhniy proezd 1, Moscow, 117545
Supplementary files
