Outlier Detection in Complex Structured Event Streams
- Authors: Kazachuk M.A.1, Petrovskiy M.I.1, Mashechkin I.V.1, Gorokhov O.E.1
-
Affiliations:
- Faculty of Computational Mathematics and Cybernetics
- Issue: Vol 43, No 3 (2019)
- Pages: 101-111
- Section: Article
- URL: https://journals.rcsi.science/0278-6419/article/view/176307
- DOI: https://doi.org/10.3103/S0278641919030038
- ID: 176307
Cite item
Abstract
Outlier detection methods are now used extensively, particularly in systems for detecting internal intrusions, in medicine, and in systems for detecting extremism in public political discussions on forums and social media. The aim of this work is to consider a fuzzy method of detecting outliers, based on elliptic clustering in the higher-dimensional space of attributes and using the Mahalanobis metrics for calculating the distances between objects and the center of a cluster. A procedure developed by the authors is used to find the optimum values of metaparameters of this algorithm. The classification of both individual events and complete sessions of user activity is considered, using an algorithm based on Welch’s t-statistics. The proposed procedures display a high quality of operation in solving two important problems of the stream analysis of complex data structures: the authentication of users by keystroke dynamics, and detecting extremist information in web text messages.
About the authors
M. A. Kazachuk
Faculty of Computational Mathematics and Cybernetics
Author for correspondence.
Email: kazachuk@mlab.cs.msu.su
Russian Federation, Moscow, 119991
M. I. Petrovskiy
Faculty of Computational Mathematics and Cybernetics
Author for correspondence.
Email: michael@cs.msu.su
Russian Federation, Moscow, 119991
I. V. Mashechkin
Faculty of Computational Mathematics and Cybernetics
Author for correspondence.
Email: mash@cs.msu.su
Russian Federation, Moscow, 119991
O. E. Gorokhov
Faculty of Computational Mathematics and Cybernetics
Author for correspondence.
Email: owlman995@gmail.com
Russian Federation, Moscow, 119991
Supplementary files
