Information-theoretic analysis of efficiency of the phonetic encoding–decoding method in automatic speech recognition
- Авторы: Savchenko V.1, Savchenko A.2
-
Учреждения:
- Nizhny Novgorod State Linguistic University
- National Research University Higher School of Economics
- Выпуск: Том 61, № 4 (2016)
- Страницы: 430-435
- Раздел: Theory and Methods of Signal Processing
- URL: https://journals.rcsi.science/1064-2269/article/view/196922
- DOI: https://doi.org/10.1134/S1064226916040112
- ID: 196922
Цитировать
Аннотация
A words phonetic decoding method in automatic speech recognition is considered. The properties of Kullback–Leibler divergence are used to synthesize the estimation of the distribution of divergence between minimum speech units (e.g., single phonemes) inside a single class. It is demonstrated that the minimum variance of the intraphonemic divergence is reached when the phonetic database is tuned to the voice of a single speaker. The estimations are proven by experimental results on the recognition of vowel sounds and isolated words of Russian language.
Об авторах
V. Savchenko
Nizhny Novgorod State Linguistic University
Email: avsavchenko@hse.ru
Россия, ul. Minina 31a, Nizhny Novgorod, 603155
A. Savchenko
National Research University Higher School of Economics
Автор, ответственный за переписку.
Email: avsavchenko@hse.ru
Россия, Bol’shaya Pecherskaya ul. 25/12, Nizhny Novgorod, 603155