Spectral Phase Estimation Based on Deep Neural Networks for Single Channel Speech Enhancement
- Autores: Saleem N.1,2, Khattak M.1, Perez E.3
-
Afiliações:
- Department of Electrical Engineering, University of Engineering and Technology Peshawar
- Department of Electrical Engineering, FET, Gomal University
- School of Engineering and Technology, Universidad Internacional de La Rioja (UNIR)
- Edição: Volume 64, Nº 12 (2019)
- Páginas: 1372-1382
- Seção: Theory and Methods of Signal Processing
- URL: https://journals.rcsi.science/1064-2269/article/view/201634
- DOI: https://doi.org/10.1134/S1064226919120155
- ID: 201634
Citar
Resumo
Majority of speech processing algorithms operate only with the spectral magnitude, leaving spectral phase unstructured and unexplored. With recent advancement in deep neural networks (DNNs), the phase processing became more important as an innovative and emergent prospective of the DNN based speech enhancement. In this paper, a speech enhancement method based on DNN combined with spectral phase estimation is proposed to improve the quality and intelligibility of the noisy speech. During training, DNNs are trained to learn a mapping from the noisy speech utterances and predict the coefficient to construct an ideal ratio mask for the spectral magnitude. The temporal smoothing unwrapped spectral phase estimation is incorporated as a target and transformed into a structured spectral phase during signal reconstruction. In enhancement stage, the enhanced speech magnitude is reconstructed with estimated structured spectral phase. Experimental results demonstrate success of the proposed method for speech enhancement in terms of the speech quality and intelligibility.
Palavras-chave
Sobre autores
N. Saleem
Department of Electrical Engineering, University of Engineering and Technology Peshawar; Department of Electrical Engineering, FET, Gomal University
Autor responsável pela correspondência
Email: nasirsaleem@gu.edu.pk
Paquistão, Khyber Pakhtunkhva; D.I. Khan, Dera Ismail Khan
M. Khattak
Department of Electrical Engineering, University of Engineering and Technology Peshawar
Autor responsável pela correspondência
Email: m.i.khattak@uetpeshawar.edu.pk
Paquistão, Khyber Pakhtunkhva
E. Perez
School of Engineering and Technology, Universidad Internacional de La Rioja (UNIR)
Autor responsável pela correspondência
Email: elena.verdu@unir.net
Espanha, Logroño, La Rioja, 26006