Constructing a speech audio–video corpus by aligning long segments of speech and text

I. A. Karpukhin; A. S. Konushin

doi:10.3103/S0278641917020030

Constructing a speech audio–video corpus by aligning long segments of speech and text

Авторлар: Karpukhin I.A.¹, Konushin A.S.¹
Мекемелер:
1. Department of Computational Mathematics and Cybernetics
Шығарылым: Том 41, № 2 (2017)
Беттер: 97-103
Бөлім: Article
URL: https://journals.rcsi.science/0278-6419/article/view/176185
DOI: https://doi.org/10.3103/S0278641917020030
ID: 176185

Дәйексөз келтіру

Толық мәтін

Ашық рұқсат
Рұқсат жабық

Рұқсат берілді
Рұқсат жабық

Тек жазылушылар үшін

Аннотация
Авторлар туралы
Әдебиет тізімі
Қосымша файлдар
Статистика

Аннотация

A new algorithm for aligning text with speech audio signals having lengths of up to several hours is proposed. The algorithm allows its quality to be effectively evaluated. The requirements on the acoustic model are not very demanding. The algorithm can be used to design an audio–video course for learning the Russian language.

Негізгі сөздер

aligning speech and text, audio–visual speech recognition

Авторлар туралы

I. Karpukhin

Department of Computational Mathematics and Cybernetics

Хат алмасуға жауапты Автор.
Email: karpuhini@yandex.ru
Ресей, Moscow, 119991

A. Konushin

Department of Computational Mathematics and Cybernetics

Email: karpuhini@yandex.ru
Ресей, Moscow, 119991

Қосымша файлдар

Әрекет

1. JATS XML

Жүктеу

Пайдаланушының аты
Құпиясөз
Мені есте сақтау

Құпия сөзді ұмыттыңыз ба?	Тіркеу

Пайдаланушының аты
Құпиясөз
Мені есте сақтау

Құпия сөзді ұмыттыңыз ба?	Тіркеу