Automated definition of phonetically homogeneous sections of words in a natural language based on multiparameter optimization
- Авторы: Korsun O.1, Poliev A.1
-
Учреждения:
- State Research Institute of Aviation Systems (FGUP GosNIIAS)
- Выпуск: Том 55, № 4 (2016)
- Страницы: 609-618
- Раздел: Pattern Recognition and Image Processing
- URL: https://journals.rcsi.science/1064-2307/article/view/219692
- DOI: https://doi.org/10.1134/S1064230716040080
- ID: 219692
Цитировать
Аннотация
An approach to the automated splitting of words into phonetically homogeneous parts is proposed under which the boundaries of the parts are defined as a result of solving a multiparameter optimization problem. The approach is assumed to ensure the maximum difference in the phonetic material between the adjacent parts and the maximum similarity within the parts. The accepted measure of similarity and difference is based on the correlation between the columns of the parametric portrait matrix of the word generated as a result of a time-spectral conversion of an audio recording of the word. To obtain a numerical solution of the problem, an algorithm is proposed which is a modification of a dynamic programming technique. The experimental results are presented with several words from the Russian language taken as examples to confirm the legitimacy of the assumptions made and viability of the algorithms proposed.
Об авторах
O. Korsun
State Research Institute of Aviation Systems (FGUP GosNIIAS)
Автор, ответственный за переписку.
Email: marmotto@rambler.ru
Россия, Moscow
A. Poliev
State Research Institute of Aviation Systems (FGUP GosNIIAS)
Email: marmotto@rambler.ru
Россия, Moscow
![](/img/style/loading.gif)