Outlier Detection in Complex Structured Event Streams


Cite item

Full Text

Open Access Open Access
Restricted Access Access granted
Restricted Access Subscription Access

Abstract

Outlier detection methods are now used extensively, particularly in systems for detecting internal intrusions, in medicine, and in systems for detecting extremism in public political discussions on forums and social media. The aim of this work is to consider a fuzzy method of detecting outliers, based on elliptic clustering in the higher-dimensional space of attributes and using the Mahalanobis metrics for calculating the distances between objects and the center of a cluster. A procedure developed by the authors is used to find the optimum values of metaparameters of this algorithm. The classification of both individual events and complete sessions of user activity is considered, using an algorithm based on Welch’s t-statistics. The proposed procedures display a high quality of operation in solving two important problems of the stream analysis of complex data structures: the authentication of users by keystroke dynamics, and detecting extremist information in web text messages.

About the authors

M. A. Kazachuk

Faculty of Computational Mathematics and Cybernetics

Author for correspondence.
Email: kazachuk@mlab.cs.msu.su
Russian Federation, Moscow, 119991

M. I. Petrovskiy

Faculty of Computational Mathematics and Cybernetics

Author for correspondence.
Email: michael@cs.msu.su
Russian Federation, Moscow, 119991

I. V. Mashechkin

Faculty of Computational Mathematics and Cybernetics

Author for correspondence.
Email: mash@cs.msu.su
Russian Federation, Moscow, 119991

O. E. Gorokhov

Faculty of Computational Mathematics and Cybernetics

Author for correspondence.
Email: owlman995@gmail.com
Russian Federation, Moscow, 119991

Supplementary files

Supplementary Files
Action
1. JATS XML

Copyright (c) 2019 Allerton Press, Inc.