A study of neural network Russian language models for automatic continuous speech recognition systems
- Authors: Kipyatkova I.S.1,2, Karpov A.A.1
-
Affiliations:
- St. Petersburg Institute for Informatics and Automation
- State University of Aerospace Instrumentation
- Issue: Vol 78, No 5 (2017)
- Pages: 858-867
- Section: Data Analysis
- URL: https://journals.rcsi.science/0005-1179/article/view/150599
- DOI: https://doi.org/10.1134/S0005117917050083
- ID: 150599
Cite item
Abstract
We show the results of studying models of the Russian language constructed with recurrent artificial neural networks for systems of automatic recognition of continuous speech. We construct neural network models with different number of elements in the hidden layer and perform linear interpolation of neural network models with the baseline trigram language model. The resulting models were used at the stage of rescoring the N best list. In our experiments on the recognition of continuous Russian speech with extra-large vocabulary (150 thousands of word forms), the relative reduction in the word error rate obtained after rescoring the 50 best list with the neural network language models interpolated with the trigram model was 14%.
About the authors
I. S. Kipyatkova
St. Petersburg Institute for Informatics and Automation; State University of Aerospace Instrumentation
Author for correspondence.
Email: kipyatkova@iias.spb.su
Russian Federation, St. Petersburg; St. Petersburg
A. A. Karpov
St. Petersburg Institute for Informatics and Automation
Email: kipyatkova@iias.spb.su
Russian Federation, St. Petersburg
Supplementary files
