Detection of HMM Synthesized Speech by Wavelet Logarithmic Spectrum


如何引用文章

全文:

开放存取 开放存取
受限制的访问 ##reader.subscriptionAccessGranted##
受限制的访问 订阅存取

详细

Automatic speaker verification systems have achieved great performance and been widely adopted in many security applications. One of the important requirements for the verification system is its resilience to spoofing attacks, such as impersonation, replay, speech synthesis and voice conversion. Among these attacks, speech synthesis has a high risk to the verification systems. In this paper, a novel detection method for computer-generated speech, especially for HMM synthetic speech, is proposed. It is found that the wavelet coefficients in specified position show the obvious difference between the synthetic and natural speech. The logarithmic spectrum features are extracted from the wavelet coefficients and support vector machine is used as the classifier to evaluate the performance of our proposed algorithm. The experimental results over SAS corpus show that the proposed algorithm can achieve high detection accuracy and low equal error rate.

作者简介

Diqun Yan

College of Information Science and Engineering, Ningbo University; Guangdong Key Laboratory of Intelligent Information Processing and Shenzhen Key Laboratory of Media Security

编辑信件的主要联系方式.
Email: yandiqun@nbu.edu.cn
中国, Ningbo, 315211; Shenzhen, 518060

Li Xiang

College of Information Science and Engineering, Ningbo University

Email: yandiqun@nbu.edu.cn
中国, Ningbo, 315211

Zhifeng Wang

College of Information Science and Engineering, Ningbo University

Email: yandiqun@nbu.edu.cn
中国, Ningbo, 315211

Rangding Wang

College of Information Science and Engineering, Ningbo University

Email: yandiqun@nbu.edu.cn
中国, Ningbo, 315211

补充文件

附件文件
动作
1. JATS XML

版权所有 © Allerton Press, Inc., 2019