Information-theoretic analysis of efficiency of the phonetic encoding–decoding method in automatic speech recognition
- Autores: Savchenko V.1, Savchenko A.2
-
Afiliações:
- Nizhny Novgorod State Linguistic University
- National Research University Higher School of Economics
- Edição: Volume 61, Nº 4 (2016)
- Páginas: 430-435
- Seção: Theory and Methods of Signal Processing
- URL: https://journals.rcsi.science/1064-2269/article/view/196922
- DOI: https://doi.org/10.1134/S1064226916040112
- ID: 196922
Citar
Resumo
A words phonetic decoding method in automatic speech recognition is considered. The properties of Kullback–Leibler divergence are used to synthesize the estimation of the distribution of divergence between minimum speech units (e.g., single phonemes) inside a single class. It is demonstrated that the minimum variance of the intraphonemic divergence is reached when the phonetic database is tuned to the voice of a single speaker. The estimations are proven by experimental results on the recognition of vowel sounds and isolated words of Russian language.
Sobre autores
V. Savchenko
Nizhny Novgorod State Linguistic University
Email: avsavchenko@hse.ru
Rússia, ul. Minina 31a, Nizhny Novgorod, 603155
A. Savchenko
National Research University Higher School of Economics
Autor responsável pela correspondência
Email: avsavchenko@hse.ru
Rússia, Bol’shaya Pecherskaya ul. 25/12, Nizhny Novgorod, 603155