Information-theoretic analysis of efficiency of the phonetic encoding–decoding method in automatic speech recognition

V. V. Savchenko; A. V. Savchenko

doi:10.1134/S1064226916040112

Information-theoretic analysis of efficiency of the phonetic encoding–decoding method in automatic speech recognition

作者: Savchenko V.V.¹, Savchenko A.V.²
隶属关系:
1. Nizhny Novgorod State Linguistic University
2. National Research University Higher School of Economics
期: 卷 61, 编号 4 (2016)
页面: 430-435
栏目: Theory and Methods of Signal Processing
URL: https://journals.rcsi.science/1064-2269/article/view/196922
DOI: https://doi.org/10.1134/S1064226916040112
ID: 196922

如何引用文章

全文:

开放存取

##reader.subscriptionAccessGranted##
受限制的访问

订阅存取

详细
作者简介
参考
补充文件
统计

详细

A words phonetic decoding method in automatic speech recognition is considered. The properties of Kullback–Leibler divergence are used to synthesize the estimation of the distribution of divergence between minimum speech units (e.g., single phonemes) inside a single class. It is demonstrated that the minimum variance of the intraphonemic divergence is reached when the phonetic database is tuned to the voice of a single speaker. The estimations are proven by experimental results on the recognition of vowel sounds and isolated words of Russian language.

关键词

Speech Perception, Automatic Speech Recognition, Automatic Speech Recognition System, Voice Signal, Single Speaker

作者简介

V. Savchenko

Nizhny Novgorod State Linguistic University

Email: avsavchenko@hse.ru
俄罗斯联邦, ul. Minina 31a, Nizhny Novgorod, 603155

A. Savchenko

National Research University Higher School of Economics

编辑信件的主要联系方式.
Email: avsavchenko@hse.ru
俄罗斯联邦, Bol’shaya Pecherskaya ul. 25/12, Nizhny Novgorod, 603155

补充文件

附件文件

动作

1. JATS XML

下载

用户名
密码
记住我

忘记您的密码?	注册

用户名
密码
记住我

忘记您的密码?	注册