Whispered speech recognition based on gammatone filterbank cepstral coefficients


Cite item

Full Text

Open Access Open Access
Restricted Access Access granted
Restricted Access Subscription Access

Abstract

This paper presents the results on whispered speech recognition using gammatone filterbank cepstral coefficients for speaker dependent mode. The isolated words used for this experiment are taken from the Whi-Spe database. Whispered speech recognition is based on dynamic time warping and hidden Markov models methods. The experiments are focused on the following modes: normal speech, whispered speech and their combinations (normal/whispered and whispered/normal). The results demonstrated an important improvement in recognition after application of cepstral mean subtraction, especially in mixed train/test scenarios.

About the authors

B. Marković

Telecommunication Department, School of Electrical Engineering

Author for correspondence.
Email: brankomarko@yahoo.com
Serbia, Belgrade, 11000

J. Galić

Telecommunication Department, School of Electrical Engineering

Email: brankomarko@yahoo.com
Serbia, Belgrade, 11000

Ð. Grozdić

Telecommunication Department, School of Electrical Engineering

Email: brankomarko@yahoo.com
Serbia, Belgrade, 11000

S. T. Jovičić

Telecommunication Department, School of Electrical Engineering

Email: brankomarko@yahoo.com
Serbia, Belgrade, 11000

M. Mijić

Telecommunication Department, School of Electrical Engineering

Email: brankomarko@yahoo.com
Serbia, Belgrade, 11000


Copyright (c) 2017 Pleiades Publishing, Inc.

This website uses cookies

You consent to our cookies if you continue to use our website.

About Cookies