Constructing a speech audio–video corpus by aligning long segments of speech and text


Cite item

Full Text

Open Access Open Access
Restricted Access Access granted
Restricted Access Subscription Access

Abstract

A new algorithm for aligning text with speech audio signals having lengths of up to several hours is proposed. The algorithm allows its quality to be effectively evaluated. The requirements on the acoustic model are not very demanding. The algorithm can be used to design an audio–video course for learning the Russian language.

About the authors

I. A. Karpukhin

Department of Computational Mathematics and Cybernetics

Author for correspondence.
Email: karpuhini@yandex.ru
Russian Federation, Moscow, 119991

A. S. Konushin

Department of Computational Mathematics and Cybernetics

Email: karpuhini@yandex.ru
Russian Federation, Moscow, 119991

Supplementary files

Supplementary Files
Action
1. JATS XML

Copyright (c) 2017 Allerton Press, Inc.