Spectral Phase Estimation Based on Deep Neural Networks for Single Channel Speech Enhancement


如何引用文章

全文:

开放存取 开放存取
受限制的访问 ##reader.subscriptionAccessGranted##
受限制的访问 订阅存取

详细

Majority of speech processing algorithms operate only with the spectral magnitude, leaving spectral phase unstructured and unexplored. With recent advancement in deep neural networks (DNNs), the phase processing became more important as an innovative and emergent prospective of the DNN based speech enhancement. In this paper, a speech enhancement method based on DNN combined with spectral phase estimation is proposed to improve the quality and intelligibility of the noisy speech. During training, DNNs are trained to learn a mapping from the noisy speech utterances and predict the coefficient to construct an ideal ratio mask for the spectral magnitude. The temporal smoothing unwrapped spectral phase estimation is incorporated as a target and transformed into a structured spectral phase during signal reconstruction. In enhancement stage, the enhanced speech magnitude is reconstructed with estimated structured spectral phase. Experimental results demonstrate success of the proposed method for speech enhancement in terms of the speech quality and intelligibility.

作者简介

N. Saleem

Department of Electrical Engineering, University of Engineering and Technology Peshawar; Department of Electrical Engineering, FET, Gomal University

编辑信件的主要联系方式.
Email: nasirsaleem@gu.edu.pk
巴基斯坦, Khyber Pakhtunkhva; D.I. Khan, Dera Ismail Khan

M. Khattak

Department of Electrical Engineering, University of Engineering and Technology Peshawar

编辑信件的主要联系方式.
Email: m.i.khattak@uetpeshawar.edu.pk
巴基斯坦, Khyber Pakhtunkhva

E. Perez

School of Engineering and Technology, Universidad Internacional de La Rioja (UNIR)

编辑信件的主要联系方式.
Email: elena.verdu@unir.net
西班牙, Logroño, La Rioja, 26006


版权所有 © Pleiades Publishing, Inc., 2019
##common.cookie##