Whispered speech recognition based on gammatone filterbank cepstral coefficients

B. Marković; J. Galić; Ð. Grozdić; S. T. Jovičić; M. Mijić

doi:10.1134/S1064226917110134

Whispered speech recognition based on gammatone filterbank cepstral coefficients

Authors: Marković B.¹, Galić J.¹, Grozdić Ð.¹, Jovičić S.T.¹, Mijić M.¹
Affiliations:
1. Telecommunication Department, School of Electrical Engineering
Issue: Vol 62, No 11 (2017)
Pages: 1255-1261
Section: Theory and Methods of Signal Processing
URL: https://journals.rcsi.science/1064-2269/article/view/198953
DOI: https://doi.org/10.1134/S1064226917110134
ID: 198953

Cite item

Full Text

Open Access
Restricted Access

Access granted
Restricted Access

Subscription Access

Abstract
About the authors
References
Supplementary files
Statistics

Abstract

This paper presents the results on whispered speech recognition using gammatone filterbank cepstral coefficients for speaker dependent mode. The isolated words used for this experiment are taken from the Whi-Spe database. Whispered speech recognition is based on dynamic time warping and hidden Markov models methods. The experiments are focused on the following modes: normal speech, whispered speech and their combinations (normal/whispered and whispered/normal). The results demonstrated an important improvement in recognition after application of cepstral mean subtraction, especially in mixed train/test scenarios.

Supplementary files

Supplementary Files

Action

1. JATS XML

Download

Username
Password
Remember me

Forgot password?	Register

Username
Password
Remember me

Forgot password?	Register

Whispered speech recognition based on gammatone filterbank cepstral coefficients

Full Text

Abstract

About the authors

B. Marković

J. Galić

Ð. Grozdić

S. T. Jovičić

M. Mijić

Supplementary files