Speech Signal Segmentation into Vocalized and Unvocalized Segments on the Basis of Simultaneous Masking
- 作者: Konev A.A.1, Meshcheryakov R.V.1, Kostyuchenko E.Y.1
-
隶属关系:
- Tomsk State University of Control Systems and Radio-Electronics
- 期: 卷 54, 编号 4 (2018)
- 页面: 361-366
- 栏目: Analysis and Synthesis of Signals and Images
- URL: https://journals.rcsi.science/8756-6990/article/view/212508
- DOI: https://doi.org/10.3103/S8756699018040076
- ID: 212508
如何引用文章
详细
This paper touches upon a model of simultaneous acoustic masking, which detects speech signal components perceived by a human’s auditory system. A simultaneous masking algorithm on the basis of this model is proposed. It is shown that, after simultaneous masking, a signal becomes a binary structure that reflects the harmonic structure of a vocalized sequence. It is experimentally proven that this structure can be used to detect key speech segments (from the standpoint of perception by an auditory system). This structure serves as a basis for an algorithm of high-quality segmentation of a speech signal into vocalized and unvocalized segments, which does not require learning before use. The joint use of the algorithms for simultaneous masking and speech signal segmentation is tested, and their performance is evaluated.
作者简介
A. Konev
Tomsk State University of Control Systems and Radio-Electronics
Email: key@keva.tusur.ru
俄罗斯联邦, pr. Lenina 40, Tomsk, 634050
R. Meshcheryakov
Tomsk State University of Control Systems and Radio-Electronics
Email: key@keva.tusur.ru
俄罗斯联邦, pr. Lenina 40, Tomsk, 634050
E. Kostyuchenko
Tomsk State University of Control Systems and Radio-Electronics
编辑信件的主要联系方式.
Email: key@keva.tusur.ru
俄罗斯联邦, pr. Lenina 40, Tomsk, 634050
补充文件
