Applying Time Series for Background User Identification Based on Their Text Data Analysis
- Авторы: Korolev V.Y.1, Korchagin A.Y.1, Mashechkin I.V.1, Petrovskii M.I.1, Tsarev D.V.1
-
Учреждения:
- Faculty of Computational Mathematics and Cybernetics, Lomonosov State University
- Выпуск: Том 44, № 5 (2018)
- Страницы: 353-362
- Раздел: Article
- URL: https://journals.rcsi.science/0361-7688/article/view/176666
- DOI: https://doi.org/10.1134/S0361768818050055
- ID: 176666
Цитировать
Аннотация
An approach to user identification based on deviations of their topic trends in operation with text information is presented. An approach is proposed to solve this problem; the approach implies topic analysis of the user’s past trends (behavior) in operation with text content of various (including confidential) categories and forecast of their future behavior. The topic analysis of user’s operation implies determining the principal topics of their text content and calculating their respective weights at the given instants. Deviations in the behavior in the user’s operation with the content from the forecast are used to identify this user. In the framework of this approach, our own original time series forecasting method is proposed based on orthogonal non-negative matrix factorization (ONMF). Note that ONMF has not been used to solve time series forecasting problems before. The experimental research held on the example of real-world corporate emailing formed out of the Enron data set showed the proposed user identification approach to be applicable.
Об авторах
V. Korolev
Faculty of Computational Mathematics and Cybernetics, Lomonosov State University
Автор, ответственный за переписку.
Email: bruce27@yandex.ru
Россия, Moscow, 119991
A. Korchagin
Faculty of Computational Mathematics and Cybernetics, Lomonosov State University
Автор, ответственный за переписку.
Email: proton.ru@gmail.com
Россия, Moscow, 119991
I. Mashechkin
Faculty of Computational Mathematics and Cybernetics, Lomonosov State University
Автор, ответственный за переписку.
Email: mash@cs.msu.su
Россия, Moscow, 119991
M. Petrovskii
Faculty of Computational Mathematics and Cybernetics, Lomonosov State University
Автор, ответственный за переписку.
Email: michael@cs.msu.su
Россия, Moscow, 119991
D. Tsarev
Faculty of Computational Mathematics and Cybernetics, Lomonosov State University
Автор, ответственный за переписку.
Email: tsarev@cs.msu.su
Россия, Moscow, 119991
Дополнительные файлы
