🔧На сайте запланированы технические работы
25.12.2025 в промежутке с 18:00 до 21:00 по Московскому времени (GMT+3) на сайте будут проводиться плановые технические работы. Возможны перебои с доступом к сайту. Приносим извинения за временные неудобства. Благодарим за понимание!
🔧Site maintenance is scheduled.
Scheduled maintenance will be performed on the site from 6:00 PM to 9:00 PM Moscow time (GMT+3) on December 25, 2025. Site access may be interrupted. We apologize for the inconvenience. Thank you for your understanding!

 

Constructing a speech audio–video corpus by aligning long segments of speech and text


Citar

Texto integral

Acesso aberto Acesso aberto
Acesso é fechado Acesso está concedido
Acesso é fechado Somente assinantes

Resumo

A new algorithm for aligning text with speech audio signals having lengths of up to several hours is proposed. The algorithm allows its quality to be effectively evaluated. The requirements on the acoustic model are not very demanding. The algorithm can be used to design an audio–video course for learning the Russian language.

Sobre autores

I. Karpukhin

Department of Computational Mathematics and Cybernetics

Autor responsável pela correspondência
Email: karpuhini@yandex.ru
Rússia, Moscow, 119991

A. Konushin

Department of Computational Mathematics and Cybernetics

Email: karpuhini@yandex.ru
Rússia, Moscow, 119991

Arquivos suplementares

Arquivos suplementares
Ação
1. JATS XML

Declaração de direitos autorais © Allerton Press, Inc., 2017