An Efficient Algorithm for Mapping of Reads to a Genome Graph Using an Index Based on Hash Tables and Dynamic Programming


Cite item

Full Text

Open Access Open Access
Restricted Access Access granted
Restricted Access Subscription Access

Abstract

The problem of storage of the sequences of a number of closely related genomes and analysis of genome variations is considered. A genome graph with the structure of an acyclic directed graph is used to store matching sections of sequences and known variants. An algorithm for rapid mapping of reads to the genome graph is developed to align the individual nucleotide sequence fragments to the genome graph. The algorithm combines rapid searching using hash tables with the algorithm of dynamic programming and solves the problem of exponential growth in the number of paths on the graph. The implementation of the genome graph and the algorithm of the alignment of reads is developed. A comparison with the best-known programs with similar functionality is made.

About the authors

S. N. Petrov

Vavilov Institute of General Genetics; Moscow Institute of Physics and Technology (State University), Institutskiy per. 9

Email: vsevolod.makeev@gmail.com
Russian Federation, ul. Gubkina 3, Moscow, 119991; Dolgoprudny, Moscow oblast, 141701

L. A. Uroshlev

Vavilov Institute of General Genetics; Moscow Institute of Physics and Technology (State University), Institutskiy per. 9

Email: vsevolod.makeev@gmail.com
Russian Federation, ul. Gubkina 3, Moscow, 119991; Dolgoprudny, Moscow oblast, 141701

A. S. Kasyanov

Vavilov Institute of General Genetics; Moscow Institute of Physics and Technology (State University), Institutskiy per. 9

Email: vsevolod.makeev@gmail.com
Russian Federation, ul. Gubkina 3, Moscow, 119991; Dolgoprudny, Moscow oblast, 141701

V. Yu. Makeev

Vavilov Institute of General Genetics; Moscow Institute of Physics and Technology (State University), Institutskiy per. 9; Engelhardt Institute of Molecular Biology; Research Institute for Genetics and Selection of Industrial Microorganisms

Author for correspondence.
Email: vsevolod.makeev@gmail.com
Russian Federation, ul. Gubkina 3, Moscow, 119991; Dolgoprudny, Moscow oblast, 141701; ul. Vavilova 32, Moscow, 119991; Pervyj Dorozhniy proezd 1, Moscow, 117545

Supplementary files

Supplementary Files
Action
1. JATS XML

Copyright (c) 2018 Pleiades Publishing, Inc.