Constructing a speech audio–video corpus by aligning long segments of speech and text

I. A. Karpukhin; A. S. Konushin

doi:10.3103/S0278641917020030

Constructing a speech audio–video corpus by aligning long segments of speech and text

Авторы: Karpukhin I.A.¹, Konushin A.S.¹
Учреждения:
1. Department of Computational Mathematics and Cybernetics
Выпуск: Том 41, № 2 (2017)
Страницы: 97-103
Раздел: Article
URL: https://journals.rcsi.science/0278-6419/article/view/176185
DOI: https://doi.org/10.3103/S0278641917020030
ID: 176185

Цитировать

Полный текст

Открытый доступ
Доступ закрыт

Доступ предоставлен
Доступ закрыт

Только для подписчиков

Аннотация
Об авторах
Список литературы
Дополнительные файлы
Статистика

Аннотация

A new algorithm for aligning text with speech audio signals having lengths of up to several hours is proposed. The algorithm allows its quality to be effectively evaluated. The requirements on the acoustic model are not very demanding. The algorithm can be used to design an audio–video course for learning the Russian language.

Ключевые слова

aligning speech and text, audio–visual speech recognition

Об авторах

I. Karpukhin

Department of Computational Mathematics and Cybernetics

Автор, ответственный за переписку.
Email: karpuhini@yandex.ru
Россия, Moscow, 119991

A. Konushin

Department of Computational Mathematics and Cybernetics

Email: karpuhini@yandex.ru
Россия, Moscow, 119991

Дополнительные файлы

Доп. файлы

Действие

1. JATS XML

Скачать

Имя пользователя
Пароль
Запомнить меня

Забыли пароль?	Регистрация

Имя пользователя
Пароль
Запомнить меня

Забыли пароль?	Регистрация