Open Access Open Access  Restricted Access Access granted  Restricted Access Subscription Access

Vol 29, No 4 (2019)

Mathematical Theory of Pattern Recognition

An Accelerated Exact Algorithm for the One-Dimensional M-Variance Problem

Kel’manov A.V., Ruzankin P.S.

Abstract

The known quadratic \(NP\)-hard (in the strong sense) \(M\)-variance problem is considered. It arises in the following typical problem of data analysis: in a set of \(N\) objects determined by their characteristics (features), find a subset of \(M\) elements close to each other. For the one-dimensional case, an accelerated exact algorithm with complexity \(\mathcal{O}(N{\kern 1pt} \log{\kern 1pt} N)\) is proposed.

Pattern Recognition and Image Analysis. 2019;29(4):573-576
pages 573-576 views

Subjective Restoration of Mathematical Models for a Research Object, Its Measurements, and Measurement-Data Interpretation

Pyt’ev Y.P., Falomkina O.V., Shishkin S.A.

Abstract

The problems of empirical reconstruction of the subjective model of a research object (RO), the subjective model of its measurements, their subjective analysis, and subjective interpretation of the measurement data are considered. To solve these problems, we use the mathematical formalism for subjective modeling (MFSM), subjective judgments made by the researcher–modeler (r.–m.) concerning the mathematical model of the RO and its measurements and based on his scientific experience and intuition. The subjective models of measurements of the RO and measurement-data interpretation are defined by the r.–m. as elements of a parametric family of smoothing splines. It is shown that the maximum posterior accuracy of the subjective interpretation of the measurement-experiment data, which is “observed” in the solution process for the problems of restoring the subjective models of the RO and its measurements, analysis, and measurement-data interpretation, can serve as a criterion for the truth of the subjective models of the measurement experiment and interpretation of the obtained measurement data, since the criterion for the accuracy of measurement-data interpretation is not used in the reconstruction of the above models. The paper suggests the principle of the maximum posterior accuracy of the subjective interpretation of measurement-experiment data as a criterion for the adequacy of subjectively reconstructed models of measurement experiments and interpretation of measurement data.

Pattern Recognition and Image Analysis. 2019;29(4):577-591
pages 577-591 views

Kernel-Distance-Based Intuitionistic Fuzzy c-Means Clustering Algorithm and Its Application

Xiangxiao L., Honglin O., Lijuan X.

Abstract

Image segmentation plays an important role in machine vision, image recognition, and imaging applications. Based on the fuzzy c-means clustering algorithm, a kernel-distance-based intuitionistic fuzzy c-means clustering (KIFCM) algorithm is proposed. First, a fuzzy complement operator is used to generate the membership degree whereby the hesitation degree of intuitionistic fuzzy set is generated; second, a kernel-induced function is used to calculate the distance from each point to the cluster center instead of the Euclidean distance; third, a new objective function that includes the hesitation degree is established, and the optimization of the objective function results in new iterative expressions for the membership degree and the cluster center. The proposed KIFCM algorithm is compared with the fuzzy c-means clustering (FCM) algorithm, the kernel fuzzy c-means clustering (KFCM) algorithm, and the intuitionistic fuzzy c-means clustering (IFCM) algorithm in segmenting five images. The experimental results verify the effectiveness and superiority of our proposed KIFCM algorithm.

Pattern Recognition and Image Analysis. 2019;29(4):592-597
pages 592-597 views

Mathematical Theory of Images and Signals Representing, Processing, Analysis, Recognition and Understanding

Descriptive Image Analysis: Part II. Descriptive Image Models

Gurevich I.B., Yashina V.V.

Abstract

The article is the second in a series on the current state and prospects of Descriptive Image Analysis, which is the leading branch of the modern mathematical theory of image analysis. Descriptive image analysis is a logically organized set of descriptive methods and models for analyzing and evaluating information in the form of images and for automating knowledge and data extraction from images necessary for making intelligent decisions about real-world scenes displayed and represented in an analyzed image. Problems on making intelligent decisions based on data analysis require formal representation of the source information, ideally, a mathematical model. Image modeling has a long, but not very productive history. Therefore, in the Descriptive Approach to image analysis and understanding (DA), the primary problem is bringing an image to a form suitable for recognition. The DA interprets the sought representation in the form of a descriptive image model (DIM). Due to the extremely complex informational nature and technical features involved in the digital representation of an image, it is impossible to construct a classical mathematical model of an image as an information object. To overcome this complexity and regularize the problem of bringing an image to a form convenient for recognition, a new mathematical object, a DIM is introduced and used in the DA. Models of recognition objects—images—and definitions of transformations over image models are considered. A formalized concept of descriptive image models is proposed. The results can be used to create a basis for methods of transforming and understanding an image as a mathematical object. The article’s main contribution to developing the mathematical theory of image analysis is understanding of an image as an information object and mathematical object.

Pattern Recognition and Image Analysis. 2019;29(4):598-612
pages 598-612 views

Research on Improvement of Stagewise Weak Orthogonal Matching Pursuit Algorithm

Pu L., Jiangtao Z., Kewen X., Qiao Z., Ziping H.

Abstract

One of the key technologies of compressed sensing is the signal reconstruction. And the two important indicators of signal reconstruction are the reconstruction probability and the time consumed. The Stagewise Weak Orthogonal Matching Pursuit (SWOMP) is widely used because the sparsity does not need to be a priori condition. The use of fixed threshold parameter in the iterative process can easily lead to overestimation and underestimation. Inspired by the idea of “the initial stage is approaching quickly and the final stage is approaching gradually,” that is, the search rule of “firstly fast and then slow,” an improved algorithm replacing the fixed threshold selection with S-shaped function value in each iteration is proposed to overcome the shortcoming that the fixed threshold parameter is selected in every iteration of SWOMP algorithm. Through compared experiment of six different S-shaped functions, the results show that the influence of different S-shaped functions on the SWOMP algorithm is different, and the improved SWOMP algorithm with the sixth S-shaped function has the best reconstruction effect.

Pattern Recognition and Image Analysis. 2019;29(4):613-620
pages 613-620 views

Generalized Spectral-Analytical Method and Its Applications in Image Analysis and Pattern Recognition Problems

Makhortykh S.A., Kulikova L.I., Pankratov A.N., Tetuev R.K.

Abstract

The generalized spectral-analytical method as a new approach to the processing of information arrays is stated. Some theoretical foundations of this method and its applications in different experimental data analysis problems are given. The method is based on the adaptive expansion of initial arrays in the functional bases belonging to the classical algebraic systems of polynomials and functions of continuous and discrete arguments (Jacobi, Chebyshev, Lagrange, Laguerre, Kravchuk, Charlier, and other polynomials). This approach combines analytical and digital data-processing procedures, thus providing a basis for the universal combined technology for the processing of information arrays. An appreciable part of this review is devoted to video data analysis and pattern-recognition problems. In addition, some relevant applications of this method in biomedical and bioinformation data analysis, recognition, classification, and diagnosis problems are described.

Pattern Recognition and Image Analysis. 2019;29(4):621-638
pages 621-638 views

A Comprehensive Review of Digital Data Hiding Techniques

Rasmi A., Arunkumar B., Anees V.M.

Abstract

This paper makes a study and overview of the currently existing data hiding schemes, from its primitive stages through the potential application. In the modern times with the advent of web technology information masking in digital imagery plays a main task of guaranteed protection and peer to peer data transfer between the intended users. Multitude schemes had predicted and employed on covering up secrecy from hackers. Initially availing signal concealment can be categorized into watermark, steganography cryptography, and so each has its unique benefits as well as demerits, fundamentally the insertion strategies have to follow certain features such as high capacity, robustness, security, payload and reliability. This paper gives a brief explanation of the multidisciplinary hiding strategies and their features in a comprised form using the tabular representation, such as watermarking, cryptography, and steganography.

Pattern Recognition and Image Analysis. 2019;29(4):639-646
pages 639-646 views

Artificial Intelligence Techniques in Pattern Recognition and Image Analysis

Estimation of the Closeness to a Semantic Pattern of a Topical Text without Construction of Periphrases

Mikhaylov D.V., Emelyanov G.M.

Abstract

The paper considers the problem of numerical estimation of the closeness of a topical text to the most rational linguistic variant (i.e. semantic pattern or sense standard) of the description of the knowledge fragment it represents without paraphrasing. This problem is relevant when implementing targeted selection of text information by the maximum of the useful semantic component with respect to the tasks solved by the user. Examples of practical applications may include selection of papers for scientific publishing and design of training courses and educational portals. In the suggested solution, the basis of the estimate of the closeness of the text to the semantic pattern is the splitting of the words of each of its phrases into classes by the TF-IDF metric value relative to texts of a corpus preformed by an expert. Abstracts of scientific papers together with their titles are analyzed. The suggested numerical estimate of closeness to the sense standard makes it possible to rank articles by the significance of the described fragments of knowledge regarding a given subject area and by non-redundancy of the description itself. Here, the semantic images of the texts closest to the semantic pattern specify the words with the highest TF-IDF values, which, when placed next to each other in the linear series of a phrase, are, most probably, semantically related and form key combinations with words whose mentioned metric is close to average. To classify word combinations as key ones, the interpretation of the TF-IDF metric, estimating the number of simultaneous occurrences of all words in the analyzed combination into phrases of the individual document, is introduced.

Pattern Recognition and Image Analysis. 2019;29(4):647-653
pages 647-653 views

Applied Artificial Intelligence Systems. Knowledge-Based and Intelligence Systems

On the Procedures of Generation of Numerical Features over Partitions of Sets of Objects in the Problem of Predicting Numerical Target Variables

Torshin I.Y., Rudakov K.V.

Abstract

Analysis of criteria for the solvability/regularity of problems and of the correctness of algorithms is applied here to the problem of prediction of the values of numerical variables. It is shown that partial regularity is a necessary and sufficient condition for the solvability of the corresponding system of the classification problems. Cross-validation experiments conducted on several datasets from the field of biomedicine (non-invasive diagnostics of magnesium concentration in blood plasma), bioinformatics (prediction of the protein secondary structure), and solid-state physics (prediction of the properties of high-temperature superconductors) have demonstrated the effectiveness of the developed methods for generating “synthetic” informative numerical features and for increasing the accuracy of prediction of the numerical target variables.

Pattern Recognition and Image Analysis. 2019;29(4):654-667
pages 654-667 views

Applied Problems

A Lightweight Network Based on Pyramid Residual Module for Human Pose Estimation

Gao B., Ma K., Bi H., Wang L.

Abstract

The human pose estimation is one of the most popular research fields. Its current accuracy is satisfactory in some cases, however, there exists a challenge for practical application due to the limited memory and computational efficiency in FPGAs and other hardware. We propose a lightweight module based on the pyramid residual module in this work. We change the convolution mode by using the depth-wise separable convolutions structure. Meanwhile, the channel split module and channel shuffle module are added to change the feature graph dimension. As a result, the parameters of the network are reduced effectively. We test the network on standard benchmarks MPII dataset, our method reduces about 50% of the training storage space while maintaining comparable accuracy. The complexity is simplified from 9 GFLOPs to 3 GFLOPs.

Pattern Recognition and Image Analysis. 2019;29(4):668-675
pages 668-675 views

Surface Classification of Damaged Concrete Using Deep Convolutional Neural Network

Hung P.D., Su N.T., Diep V.T.

Abstract

Concrete is known for its a strength and durability as a building material. It is heavily utilized in almost all infrastructures, from pipes, building structures to roads and dams. However, due to external factors or internal compositions, concrete can be damaged and hence affects the quality of the constructions. The type of damage that appeared on concrete is often the first a clue as to how it occurred. Therefore proper diagnosing of the problem can help engineers determine how quickly and how best to fix it. The application of information technology, especially artificial intelligence, to automatically classify the damage types can help tremendously in this aspect. There have been some studies in using computer vision to examine the surfaces of concrete for damages. This study attempts a more challenging task of classifying the five common types of concrete damage. A new dataset is built and the Convolutional Neural Network (CNN) architecture is used for classification. The results obtained have an accuracy of 95 and 93% on the training set and the test set respectively.

Pattern Recognition and Image Analysis. 2019;29(4):676-687
pages 676-687 views

An Effective Feature Descriptor with Gabor Filter and Uniform Local Binary Pattern Transcoding for Iris Recognition

Huo G., Guo H., Zhang Y., Zhang Q., Li W., Li B.

Abstract

Iris recognition is recognized as one of the most reliable and efficient technique for human identification in the biometric fields. The Gabor filter and local binary pattern (LBP) are widely adopted for feature extraction in face recognition. However, it is difficult to achieve high recognition accuracy when the Gabor filter or LBP is directly applied to iris texture representation. This paper presents an effective iris feature descriptor, which first uses 2D-Gabor filter to extract multi-orientation imaginary (MOI) feature, and then applies uniform LBP for region feature encoding. Thus, the MOI feature-by-point energy is converted into that of the uniform LBP histogram-by-block, during which the distributions of the intra- and inter-class are greatly widened. Such process largely improves distinguishability of MOI features. Finally, the Bhattacharyya distance is adopted for matching. Experimental results on CASIA and JLU iris image databases show that this method performs better for combining MOI features and LBP encoding as compared to their individual function.

Pattern Recognition and Image Analysis. 2019;29(4):688-694
pages 688-694 views

Application of Graphic and Image Technology in Strong Convective Weather Monitoring and Early-Warning System

Jiao F., Huang T.

Abstract

In order to improve the capability of monitoring and predicting strong convective weather of Zhaoqing area, monitoring and early-warning system of strong convective weather is developed. Graphic and image technology is applied in the development of the system. The region growing method is used in image segmentation. The map of all towns of Zhaoqing area is drawn by coordinate transformation and graphics technologies. The processed radar image and the map of Zhaoqing area have been overlapped. By using improved arc-length method, the system can judge which areas are affected by strong convective echo.

Pattern Recognition and Image Analysis. 2019;29(4):695-701
pages 695-701 views

An Efficient Human Activity Recognition Technique Based on Deep Learning

Khelalef A., Ababsa F., Benoudjit N.

Abstract

In this paper, we present a new deep learning-based human activity recognition technique. First, we track and extract human body from each frame of the video stream. Next, we abstract human silhouettes and use them to create binary space-time maps (BSTMs) which summarize human activity within a defined time interval. Finally, we use convolutional neural network (CNN) to extract features from BSTMs and classify the activities. To evaluate our approach, we carried out several tests using three public datasets: Weizmann, Keck Gesture and KTH Database. Experimental results show that our technique outperforms conventional state-of-the-art methods in term of recognition accuracy and provides comparable performance against recent deep learning techniques. It’s simple to implement, requires less computing power, and can be used for multi-subject activity recognition.

Pattern Recognition and Image Analysis. 2019;29(4):702-715
pages 702-715 views

Strong-Structural Convolution Neural Network for Semantic Segmentation

Ouyang Y.

Abstract

We present a combinatorial deep convolutional neural network architecture, termed strong convolution neural network (SSN), for semantic segmentation task. The structure of SSN consists of two components: Increment feature convolution neural network and post-process Conditional Random Fields unit (CRFs). The increment feature CNN unit has three parts: I-Block, Deconvolution layer and Transition Block. I-Block employs increment convolution to efficiently maintain feature information. Before passing through pooling layer, we put the feature map into activate layer ReLU, and batch normalization layer. In Decoding stage, we use skip-connects to keep the pooling index information. To enforce the correlation of same semantic labels, we define the strong semantic label (SSL) stage to intensify the pairwise potential energy. To achieve high computation performance, we make further improvement on SSL by employing the adaptive soft semantic sections label method. We proposed the adaptive strong semantic label selection algorithm to generate the SSL. Through the CRFs unit, with unitary energy and pairwise edge energy, the semantic segmentation initial labels transform semantic segmentation labels. Experimental evaluation reveals the training time versus accuracy trade-off involved in achieving good segmentation performance.

Pattern Recognition and Image Analysis. 2019;29(4):716-729
pages 716-729 views

Using the K-Nearest Neighbors Algorithm for Automated Detection of Myocardial Infarction by Electrocardiogram Data Entries

Savostin A.A., Ritter D.V., Savostina G.V.

Abstract

This article presents a new approach to solving the problem of automated detection of myocardial infarction of various localization by electrocardiogram data entries. Only the second standard lead is used in the analysis. The signal in this lead undergoes digital filtering in order to remove low-frequency and high-frequency interference. Then, individual cardio complexes P-QRS-T are extracted from the signal, and the following parameters are calculated for them: minimum value, maximum value, interquartile range, mean absolute deviation, root mean square, mode, and entropy. Using the calculated parameters, a standardized training (learning) dataset is formed. The classifier model represents the k-nearest neighbors algorithm with the Manhattan metric of the distance between the objects and number of neighbors k = 9. After learning, the classifier shows the results by precision pre = 98.60%, by recall rec = 97.34%, by specificity spec = 95.93%, and by accuracy acc = 97.03%. According to the analysis of the obtained results, the suggested classifier model offers certain advantages as compared to existing alternatives.

Pattern Recognition and Image Analysis. 2019;29(4):730-737
pages 730-737 views

Segmentation and Feature Extraction of Endoscopic Images for Making Diagnosis of Acute Appendicitis

Ye S., Nedzvedz A., Ye F., Ablameyko S.

Abstract

In recent years, digital endoscopy has established as key technology for medical screenings and minimally invasive surgery. Endoscopy image processing techniques have been applied to the diagnosis of diseases. In this paper, an effective approach is proposed to process endoscopic images to detect acute appendicitis. For this purpose, we first introduced image enhancement techniques that allow us to improve quality of endoscopic image for further processing. A simple and effective image segmentation technique was developed to detect vessels and vermiform appendix. The hierarchical set of features have been extracted for detecting acute appendicitis. It includes geometrical, colorimetric, densitometric, and topological features. For each appendicitis feature discriminant indexes have been introduced for diagnosis. This method has achieved good results in clinical application.

Pattern Recognition and Image Analysis. 2019;29(4):738-749
pages 738-749 views

Estimation of Blood Flow Velocity in Coronary Arteries Based on the Movement of Radiopaque Agent

Sokolov S.Y., Volchkov S.O., Bessonov I.S., Chestukhin V.V., Kurlyandskaya G.V., Blyakhman F.A.

Abstract

The paper discusses methodological techniques for increasing the diagnostic value of routine angiographic examinations of patients. It presents algorithms for digital processing of heart video images, which make it possible to quantitatively characterize hemodynamics in the coronary bed by determining the velocity of spread of a contrast agent through the arteries. The proposed approach includes several stages and procedures and takes account of the errors caused by the movement of the arteries due to the mechanical activity of the heart. The paper presents the results of estimating coronary blood-flow velocity in a patient with coronary heart disease, which are compared with computer simulation data. The sources of errors, ways to minimize them, and prospects for using the proposed methodology for effective angiographic diagnosis are discussed.

Pattern Recognition and Image Analysis. 2019;29(4):750-762
pages 750-762 views

A Method for Predicting Rare Events by Multidimensional Time Series with the Use of Collective Methods

Zhuravlev Y.I., Sen’ko O.V., Bondarenko N.N., Ryazanov V.V., Dokukin A.A., Vinogradov A.P.

Abstract

A method for predicting rare events by the preceding dynamics of features is considered. The method is analyzed on the example of the problem of predicting revocation of the license of a credit institution on the basis of the reporting indicators published at least six months before the regulator made the appropriate decision. The technology developed is based on the calculation of collective solutions by sets of recognition algorithms. Investigations have shown that the most effective prediction is obtained with the use of collective algorithms involving various types of decision forests and combinatorial and logical methods. The method developed also involves the procedure of ranking the indicators according to their information value, in which the collective ranking is calculated on the basis of information estimates obtained with the use of built-in procedures within individual recognition methods.

Pattern Recognition and Image Analysis. 2019;29(4):763-768
pages 763-768 views

Pattern Recognition and Image Analysis Milieu

pages 769-774 views

Erratum

Erratum to: Predictive Diagnosis of Glaucoma Based on Analysis of Focal Notching along the Neuro-Retinal Rim Using Machine Learning

Mukherjee R., Kundu S., Dutta K., Sen A., Majumdar S.

Abstract

erratum

Pattern Recognition and Image Analysis. 2019;29(4):775-775
pages 775-775 views

This website uses cookies

You consent to our cookies if you continue to use our website.

About Cookies