Speaker Modeling Using Emotional Speech for More Robust Speaker Identification

M. Milošević; Ž. Nedeljković; U. Glavitsch; Ž. Đurović

doi:10.1134/S1064226919110184

Speaker Modeling Using Emotional Speech for More Robust Speaker Identification

Авторы: Milošević M.¹, Nedeljković Ž.¹, Glavitsch U.², Đurović Ž.¹
Учреждения:
1. School of Electrical Engineering, University of Belgrade
2. GlavitschEggler Software
Выпуск: Том 64, № 11 (2019)
Страницы: 1256-1265
Раздел: Theory and Methods of Signal Processing
URL: https://journals.rcsi.science/1064-2269/article/view/201547
DOI: https://doi.org/10.1134/S1064226919110184
ID: 201547

Цитировать

Полный текст

Открытый доступ
Доступ закрыт

Доступ предоставлен
Доступ закрыт

Только для подписчиков

Аннотация
Об авторах
Список литературы
Дополнительные файлы
Статистика

Аннотация

Automatic identity recognition in fast, reliable and non-intrusive way is one of the most challenging topics in digital world of today. A possible approach to identity recognition is the identification by voice. Characteristics of speech relevant for automatic speaker recognition can be affected by external factors such as noise and channel distortions, but also by speaker-specific conditions—emotional or health states. The improvement of a speaker recognition system by different model training strategies are addressed in this paper in order to obtain the best performance of the system with only a limited amount of neutral and emotional speech data. The models adopted are a Gaussian Mixture Model and i-vectors whose inputs are Mel Frequency Cepstral Coefficients, and the experiments have been conducted on the Russian Language Affective speech database. The results show that the appropriate use of emotional speech in speaker model training improves the robustness of a speaker recognition system – both when tested on neutral and emotional speech.

Ключевые слова

emotion recognition, Gaussian mixture models, i-vectors, human voice, identification of persons, speaker recognition

Дополнительные файлы

Доп. файлы

Действие

1. JATS XML

Скачать

Имя пользователя
Пароль
Запомнить меня

Забыли пароль?	Регистрация

Имя пользователя
Пароль
Запомнить меня

Забыли пароль?	Регистрация

Speaker Modeling Using Emotional Speech for More Robust Speaker Identification

Полный текст

Аннотация

Ключевые слова

Об авторах

M. Milošević

Ž. Nedeljković

U. Glavitsch

Ž. Đurović

Дополнительные файлы