Constructing a speech audio–video corpus by aligning long segments of speech and text
- Авторлар: Karpukhin I.A.1, Konushin A.S.1
-
Мекемелер:
- Department of Computational Mathematics and Cybernetics
- Шығарылым: Том 41, № 2 (2017)
- Беттер: 97-103
- Бөлім: Article
- URL: https://journals.rcsi.science/0278-6419/article/view/176185
- DOI: https://doi.org/10.3103/S0278641917020030
- ID: 176185
Дәйексөз келтіру
Аннотация
A new algorithm for aligning text with speech audio signals having lengths of up to several hours is proposed. The algorithm allows its quality to be effectively evaluated. The requirements on the acoustic model are not very demanding. The algorithm can be used to design an audio–video course for learning the Russian language.
Негізгі сөздер
Авторлар туралы
I. Karpukhin
Department of Computational Mathematics and Cybernetics
Хат алмасуға жауапты Автор.
Email: karpuhini@yandex.ru
Ресей, Moscow, 119991
A. Konushin
Department of Computational Mathematics and Cybernetics
Email: karpuhini@yandex.ru
Ресей, Moscow, 119991
Қосымша файлдар
