Whispered speech recognition based on gammatone filterbank cepstral coefficients


Citar

Texto integral

Acesso aberto Acesso aberto
Acesso é fechado Acesso está concedido
Acesso é fechado Somente assinantes

Resumo

This paper presents the results on whispered speech recognition using gammatone filterbank cepstral coefficients for speaker dependent mode. The isolated words used for this experiment are taken from the Whi-Spe database. Whispered speech recognition is based on dynamic time warping and hidden Markov models methods. The experiments are focused on the following modes: normal speech, whispered speech and their combinations (normal/whispered and whispered/normal). The results demonstrated an important improvement in recognition after application of cepstral mean subtraction, especially in mixed train/test scenarios.

Sobre autores

B. Marković

Telecommunication Department, School of Electrical Engineering

Autor responsável pela correspondência
Email: brankomarko@yahoo.com
Sérvia, Belgrade, 11000

J. Galić

Telecommunication Department, School of Electrical Engineering

Email: brankomarko@yahoo.com
Sérvia, Belgrade, 11000

Ð. Grozdić

Telecommunication Department, School of Electrical Engineering

Email: brankomarko@yahoo.com
Sérvia, Belgrade, 11000

S. Jovičić

Telecommunication Department, School of Electrical Engineering

Email: brankomarko@yahoo.com
Sérvia, Belgrade, 11000

M. Mijić

Telecommunication Department, School of Electrical Engineering

Email: brankomarko@yahoo.com
Sérvia, Belgrade, 11000


Declaração de direitos autorais © Pleiades Publishing, Inc., 2017

Este site utiliza cookies

Ao continuar usando nosso site, você concorda com o procedimento de cookies que mantêm o site funcionando normalmente.

Informação sobre cookies