Applying Time Series for Background User Identification Based on Their Text Data Analysis


Cite item

Full Text

Open Access Open Access
Restricted Access Access granted
Restricted Access Subscription Access

Abstract

An approach to user identification based on deviations of their topic trends in operation with text information is presented. An approach is proposed to solve this problem; the approach implies topic analysis of the user’s past trends (behavior) in operation with text content of various (including confidential) categories and forecast of their future behavior. The topic analysis of user’s operation implies determining the principal topics of their text content and calculating their respective weights at the given instants. Deviations in the behavior in the user’s operation with the content from the forecast are used to identify this user. In the framework of this approach, our own original time series forecasting method is proposed based on orthogonal non-negative matrix factorization (ONMF). Note that ONMF has not been used to solve time series forecasting problems before. The experimental research held on the example of real-world corporate emailing formed out of the Enron data set showed the proposed user identification approach to be applicable.

About the authors

V. Yu. Korolev

Faculty of Computational Mathematics and Cybernetics, Lomonosov State University

Author for correspondence.
Email: bruce27@yandex.ru
Russian Federation, Moscow, 119991

A. Yu. Korchagin

Faculty of Computational Mathematics and Cybernetics, Lomonosov State University

Author for correspondence.
Email: proton.ru@gmail.com
Russian Federation, Moscow, 119991

I. V. Mashechkin

Faculty of Computational Mathematics and Cybernetics, Lomonosov State University

Author for correspondence.
Email: mash@cs.msu.su
Russian Federation, Moscow, 119991

M. I. Petrovskii

Faculty of Computational Mathematics and Cybernetics, Lomonosov State University

Author for correspondence.
Email: michael@cs.msu.su
Russian Federation, Moscow, 119991

D. V. Tsarev

Faculty of Computational Mathematics and Cybernetics, Lomonosov State University

Author for correspondence.
Email: tsarev@cs.msu.su
Russian Federation, Moscow, 119991


Copyright (c) 2018 Pleiades Publishing, Ltd.

This website uses cookies

You consent to our cookies if you continue to use our website.

About Cookies