A Polynomial-Time Approximation Algorithm for One Problem Simulating the Search in a Time Series for the Largest Subsequence of Similar Elements
- 作者: Kel’manov A.V.1,2, Khamidullin S.A.1, Khandeev V.I.1,2, Pyatkin A.V.1,2, Shamardin Y.V.1, Shenmaier V.V.1
-
隶属关系:
- Sobolev Institute of Mathematics, Siberian Branch
- Novosibirsk State University
- 期: 卷 28, 编号 3 (2018)
- 页面: 363-370
- 栏目: Mathematical Method in Pattern Recognition
- URL: https://journals.rcsi.science/1054-6618/article/view/195384
- DOI: https://doi.org/10.1134/S1054661818030094
- ID: 195384
如何引用文章
详细
We analyze the mathematical aspects of the data analysis problem consisting in the search (selection) for a subset of similar elements in a group of objects. The problem arises, in particular, in connection with the analysis of data in the form of time series (discrete signals). One of the problems in modeling this challenge is considered, namely, the problem of searching in a finite sequence of points from the Euclidean space for the subsequence with the greatest number of terms such that the quadratic spread of the elements of this subsequence with respect to its unknown centroid does not exceed a given percentage of the quadratic spread of elements of the input sequence with respect to its centroid. It is shown that the problem is strongly NP-hard. A polynomial-time approximation algorithm is proposed. This algorithm either establishes that the problem has no solution or finds a 1/2-approximate solution if the length M* of the optimal subsequence is even, or it yields a \(\frac{1}{2}\left(\begin{array}{c}1-\frac{1}{M^*}\\ \end{array}\right)\)-approximate solution if M* is odd. The time complexity of the algorithm is O(N3(N2+q)), where N is the number of points in the input sequence and q is the space dimension. The results of numerical simulation that demonstrate the effectiveness of the algorithm are presented.
作者简介
A. Kel’manov
Sobolev Institute of Mathematics, Siberian Branch; Novosibirsk State University
编辑信件的主要联系方式.
Email: kelm@math.nsc.ru
俄罗斯联邦, pr. Akad. Koptyuga 4, Novosibirsk; ul. Pirogova 2, Novosibirsk
S. Khamidullin
Sobolev Institute of Mathematics, Siberian Branch
Email: kelm@math.nsc.ru
俄罗斯联邦, pr. Akad. Koptyuga 4, Novosibirsk
V. Khandeev
Sobolev Institute of Mathematics, Siberian Branch; Novosibirsk State University
Email: kelm@math.nsc.ru
俄罗斯联邦, pr. Akad. Koptyuga 4, Novosibirsk; ul. Pirogova 2, Novosibirsk
A. Pyatkin
Sobolev Institute of Mathematics, Siberian Branch; Novosibirsk State University
Email: kelm@math.nsc.ru
俄罗斯联邦, pr. Akad. Koptyuga 4, Novosibirsk; ul. Pirogova 2, Novosibirsk
Yu. Shamardin
Sobolev Institute of Mathematics, Siberian Branch
Email: kelm@math.nsc.ru
俄罗斯联邦, pr. Akad. Koptyuga 4, Novosibirsk
V. Shenmaier
Sobolev Institute of Mathematics, Siberian Branch
Email: kelm@math.nsc.ru
俄罗斯联邦, pr. Akad. Koptyuga 4, Novosibirsk
补充文件
