Constructing a speech audio–video corpus by aligning long segments of speech and text
- 作者: Karpukhin I.A.1, Konushin A.S.1
-
隶属关系:
- Department of Computational Mathematics and Cybernetics
- 期: 卷 41, 编号 2 (2017)
- 页面: 97-103
- 栏目: Article
- URL: https://journals.rcsi.science/0278-6419/article/view/176185
- DOI: https://doi.org/10.3103/S0278641917020030
- ID: 176185
如何引用文章
详细
A new algorithm for aligning text with speech audio signals having lengths of up to several hours is proposed. The algorithm allows its quality to be effectively evaluated. The requirements on the acoustic model are not very demanding. The algorithm can be used to design an audio–video course for learning the Russian language.
作者简介
I. Karpukhin
Department of Computational Mathematics and Cybernetics
编辑信件的主要联系方式.
Email: karpuhini@yandex.ru
俄罗斯联邦, Moscow, 119991
A. Konushin
Department of Computational Mathematics and Cybernetics
Email: karpuhini@yandex.ru
俄罗斯联邦, Moscow, 119991
补充文件
