Spectral Phase Estimation Based on Deep Neural Networks for Single Channel Speech Enhancement
- Authors: Saleem N.1,2, Khattak M.I.1, Perez E.V.3
-
Affiliations:
- Department of Electrical Engineering, University of Engineering and Technology Peshawar
- Department of Electrical Engineering, FET, Gomal University
- School of Engineering and Technology, Universidad Internacional de La Rioja (UNIR)
- Issue: Vol 64, No 12 (2019)
- Pages: 1372-1382
- Section: Theory and Methods of Signal Processing
- URL: https://journals.rcsi.science/1064-2269/article/view/201634
- DOI: https://doi.org/10.1134/S1064226919120155
- ID: 201634
Cite item
Abstract
Majority of speech processing algorithms operate only with the spectral magnitude, leaving spectral phase unstructured and unexplored. With recent advancement in deep neural networks (DNNs), the phase processing became more important as an innovative and emergent prospective of the DNN based speech enhancement. In this paper, a speech enhancement method based on DNN combined with spectral phase estimation is proposed to improve the quality and intelligibility of the noisy speech. During training, DNNs are trained to learn a mapping from the noisy speech utterances and predict the coefficient to construct an ideal ratio mask for the spectral magnitude. The temporal smoothing unwrapped spectral phase estimation is incorporated as a target and transformed into a structured spectral phase during signal reconstruction. In enhancement stage, the enhanced speech magnitude is reconstructed with estimated structured spectral phase. Experimental results demonstrate success of the proposed method for speech enhancement in terms of the speech quality and intelligibility.
About the authors
N. Saleem
Department of Electrical Engineering, University of Engineering and Technology Peshawar; Department of Electrical Engineering, FET, Gomal University
Author for correspondence.
Email: nasirsaleem@gu.edu.pk
Pakistan, Khyber Pakhtunkhva; D.I. Khan, Dera Ismail Khan
M. I. Khattak
Department of Electrical Engineering, University of Engineering and Technology Peshawar
Author for correspondence.
Email: m.i.khattak@uetpeshawar.edu.pk
Pakistan, Khyber Pakhtunkhva
E. V. Perez
School of Engineering and Technology, Universidad Internacional de La Rioja (UNIR)
Author for correspondence.
Email: elena.verdu@unir.net
Spain, Logroño, La Rioja, 26006
![](/img/style/loading.gif)