Multisource Speech Analysis for Speaker Recognition


如何引用文章

全文:

开放存取 开放存取
受限制的访问 ##reader.subscriptionAccessGranted##
受限制的访问 订阅存取

详细

On a comprehensive speech database, speaker recognition characteristics are compared under the usage of various voice-source models. Inverse problems to find a source via vowel speech segments are solved on the base of a special speech-production model and voice-source models (A-source, piecewise-linear source, nonparametric source, and source found by means of the spectral relation method). In the first stage, we find the pulses such that the relative residuals of their segmented and their theoretical analogs computed by means of the speech-production model are less than 0.25. For the selected pulses, a posteriori estimates of the error of their determining are computed and the final selection of the source pulses is performed: for the recognition procedure, we leave only pulses with a posteriori estimates of the error less than the accepted level 0.3. In the space of parameters found for each source model, a statistical model is created for each speaker and the recognition is performed. For the speaker recognition with respect to one vowel, the mean error is approximately equal to 66% for the piecewise-linear source, 61% for the spectral relation method, and 33% for the A-source.

作者简介

V. Sorokin

Institute for Information Transmission Problems

编辑信件的主要联系方式.
Email: vns@iitp.ru
俄罗斯联邦, Bol’shoi Karetnyi per. 19, Moscow, 127994

A. Leonov

National Research Nuclear University MEPhI

编辑信件的主要联系方式.
Email: asleonov@mephi.ru
俄罗斯联邦, Kashirskoe sh. 31, Moscow, 115409

补充文件

附件文件
动作
1. JATS XML

版权所有 © Pleiades Publishing, Ltd., 2019