Acesso aberto Acesso aberto  Acesso é fechado Acesso está concedido  Acesso é fechado Somente assinantes

Volume 26, Nº 2 (2016)

Mathematical Method in Pattern Recognition

Transformation of feature space based on Fisher’s linear discriminant

Nemirko A.

Resumo

Linear transformation of data in multidimensional feature space based on Fisher’s criterion is considered. The case of two classes with arbitrary distributions is studied. We derived expressions for recurrent calculation of weight vectors which form new features. Example offered shows that the newly found features which represent the data more accurately make it possible to achieve linear separability of classes which remains impossible using the technique of principal components and the classic Fisher’s linear discriminant.

Pattern Recognition and Image Analysis. 2016;26(2):257-261
pages 257-261 views

Optimisation of multiclass supervised classification based on using output codes with error-correcting

Ryazanov V.

Resumo

An approach of solving the problem of multiclass supervised classification, based on using errorcorrecting codes is considered. The main problem here is the creation of binary code matrix, which provides high classification accuracy. Binary classifiers must be distinct and accurate. In this issue, there are many questions. What should be the elements of the matrix, how many elements provide the best accuracy and how to find them? In this paper an approach to solve some optimization problems for the construction of the binary code matrix is considered. The problem of finding the best binary classifiers (columns of matrix) is formulated as a discrete optimization problem. For some partial precedent classification approach, there is a calculation of the effective values of optimising function. Prospects of this approach are confirmed by a series of experiments on various practical tasks.

Pattern Recognition and Image Analysis. 2016;26(2):262-265
pages 262-265 views

Method of weak classifiers fuzzy boosting: Iterative learning of quasi-linear algorithmic composition

Samorodov A.

Resumo

Method of fuzzy boosting providing iterative weak classifiers selection and their quasi-linear composition construction is presented. The method is based on the combination of boosting and fuzzy integrating techniques, when at each step of boosting weak classifiers are combined by Choquet fuzzy integral. In the proposed FuzzyBoost algorithm 2-additive fuzzy measures were used, and method for their estimation was proposed. Although detailed theoretical verification of proposed algorithm is still absent, the experimental results, made on simulated data models, demonstrate that in the case of complex decision boundaries FuzzyBoost significantly outperforms AdaBoost.

Pattern Recognition and Image Analysis. 2016;26(2):266-273
pages 266-273 views

On metric spaces arising during formalization of recognition and classification problems. Part 1: Properties of compactness

Torshin I., Rudakov K.

Resumo

In the context of the algebraic approach to recognition of Yu.I. Zhuravlev’s scientific school, metric analysis of feature descriptions is necessary to obtain adequate formulations for poorly formalized recognition/classification problems. Formalization of recognition problems is a cross-disciplinary issue between supervised machine learning and unsupervised machine learning. This work presents the results of the analysis of compact metric spaces arising during the formalization of recognition problems. Necessary and sufficient conditions of compactness of metric spaces over lattices of the sets of feature descriptions are analyzed, and approaches to the completion of the discrete metric spaces (completion by lattice expansion or completion by variation of estimate) are formulated. It is shown that the analysis of compactness of metric spaces may lead to some heuristic cluster criteria commonly used in cluster analysis. During the analysis of the properties of compactness, a key concept of a ρ-network arises as a subset of points that allows one to estimate an arbitrary distance in an arbitrary metric configuration. The analysis of compactness properties and the conceptual apparatus introduced (ρ-networks, their quality functionals, the metric range condition, i- and ρ-spectra, ε-neighborhood in a metric cone, ε-isomorphism of complete weighted graphs, etc.) allow one to apply the methods of functional analysis, probability theory, metric geometry, and graph theory to the analysis of poorly formalized problems of recognition and classification.

Pattern Recognition and Image Analysis. 2016;26(2):274-284
pages 274-284 views

A method of total variation to remove the mixed Poisson-Gaussian noise

Thanh D., Dvoenko S.

Resumo

There are many modern devices are used to create digital images. These devices use optical effects to create images. Therefore, the image quality depends on quality of optical sensors. Because of the limits of technology, these sensors cannot reconstruct the images perfectly, and always include some defects. One from these defects is noise. The noise reduces image quality and result of image processing. The image noises can be classified into some types: Gaussian noise, Poisson noise, speckle noise and so on. Depending on particular noises, we have efficient methods to remove them. There is no existing a universal method to remove all noises effectively. In this paper, we proposed a method to remove a noise that is popular in biomedicine. This noise can be considered as a combination of Gaussian and Poisson noises. Our method is based on the total variation of an image intensity (brightness) function.

Pattern Recognition and Image Analysis. 2016;26(2):285-293
pages 285-293 views

Representation, Processing, Analysis and Understanding of Images

A progressive framework for dense stereo matching

Jia B., Liu S., Du Z.

Resumo

A progressive framework is proposed for dense stereo matching to solve problems caused by weaktexture and occlusion in this paper. The main idea is that disparity is extracted progressively, from coarse to fine, from sparse to dense. First, a coarse disparity map is obtained by the segment-based pre-matching method, in which horizontal and vertical segment matching are performed in parallel and pre-matching results are merged to preserve more details. Second, disparity diffusion is performed to roughly estimate disparity values for miss-matched points. Third, a probabilistic approach is used for disparity refinement, taking into account stereo prior, image likehood and disparity smoothness. Experiments are made on the Middlebury benchmark to demostrate the effectiveness of the proposed algorithm.

Pattern Recognition and Image Analysis. 2016;26(2):294-301
pages 294-301 views

Rotation, scaling and translation invariant texture recognition by Bessel-Fourier moments

Xiao B., Lu G., Zhao T., Xie L.

Resumo

The ideal of Bessel-Fourier moments (BFMs) for image analysis and only rotation invariant image cognition has been proposed recently. In this paper, we extend the previous work and propose a new method for rotation, scaling and translation (RST) invariant texture recognition using Bessel-Fourier moments. Compared with the others moments based methods, the radial polynomials of Bessel-Fourier moments have more zeros and these zeros are more evenly distributed. It makes Bessel-Fourier moments more suitable for invariant texture recognition as a generalization of orthogonal complex moments. In the experiment part, we got three testing sets of 16, 24 and 54 texture images by way of translating, rotating and scaling them separately. The correct classification percentages (CCPs) are compared with that of orthogonal Fourier-Mellin moments and Zernike moments based methods in both noise-free and noisy condition. Experimental results validate the conclusion of theoretical derivation: BFM performs better in recognition capability and noise robustness in terms of RST texture recognition under both noise-free and noisy condition when compared with orthogonal Fourier-Mellin moments and Zernike moments based methods.

Pattern Recognition and Image Analysis. 2016;26(2):302-308
pages 302-308 views

Adaptivity of conditional random field based outdoor point cloud classification

Lang D., Friedmann S., Paulus D.

Resumo

In this paper we present how adaptable learned models of graphical models are and how they can be used for classification tasks of 3D laser point clouds with different distributions and density. In order to model the contextual information we use a pair-wise conditional random field and an adaptive graph down-sampling method based on voxel grids. As feature we apply the rotation invariant histogram-of-oriented-residuals operator to describe the local point cloud distribution. We validate the approach with data collected from different laser range finders with varying point cloud distribution and density. Our experiments imply, that conditional random field models learned from one dataset can be applied to another dataset without a significant loss of precision.

Pattern Recognition and Image Analysis. 2016;26(2):309-315
pages 309-315 views

A semantic hybrid approach based on grouping adjacent regions and a combination of multiple descriptors and classifiers for automatic image annotation

Oujaoura M., Minaoui B., Fakir M.

Resumo

A large percentage of photos on the Internet cannot be reached by search engines because of the semantic gap due to the absence of textual meta-data. Despite of decades of research, neither model based approaches can provide quality annotation to images. Many segmentation algorithms use a low-level predicates to control the homogeneity of the regions. So, the resulting regions are not always being semantically compact. The first proposed approach to resolve this problem is to regroup the adjacent region of image. Many features extraction method and classifiers are also used singly, with modest results, for automatic image annotation. The second proposed approach is to select and combine together some efficient descriptors and classifiers. This document provides a hybrid semantic annotation system that combines both approaches in hopes of increasing the accuracy of the resulting annotations. The color histograms, Texture, GIST and invariant moments, used as features extraction methods, are combined together with multi-class support vector machine, Bayesian networks, Neural networks and nearest neighbor classifiers, in order to annotate the image content with the appropriate keywords. The accuracy of the proposed approach is supported by the good experimental results obtained from two image databases (ETH-80 and coil-100 databases).

Pattern Recognition and Image Analysis. 2016;26(2):316-335
pages 316-335 views

An automatic initialization of interactive segmentation methods using shortest path basins

Ryba T., Zelezny M.

Resumo

Image segmentation is one of many fundamental problems in computer vision. The need to divide an image to a number of classes is often a part of a system that uses image processing methods. Therefore, lots of methods were developed that are based on different approaches. The image segmentation could be classified with respect to many criteria. One such a criterion is based on the degree of allowed interactivity. The interactivity could be of several types—interactive initialization, interaction while the computation is running or manual refinement of achieved results, for example. Especially the precise initialization plays an important role in many methods. Therefore the possibility to initialize the method manually is often invaluable advantage and information obtained this way could be the difference between good and poor results. Unfortunately, in many cases it is not possible to initialize a method manually and the process needs to be automated. In this paper, an approach for such an automation is presented. It is based on shortest paths in a graph and deriving an area of influence for each obtained seed point. This method is called shortest path basins.

Pattern Recognition and Image Analysis. 2016;26(2):336-342
pages 336-342 views

Least-squares fitting of polygons

Sinnreich J.

Resumo

Fitting a polygon to a set of given points in the plane is a problem which may arise in certain engineering, computer graphics or scientific applications. This paper presents an algorithm which computes a continuous function closely approximating various polygons, for which the sum of the squares of the distance to the given set of points is minimized.

Pattern Recognition and Image Analysis. 2016;26(2):343-349
pages 343-349 views

Edge feature based approach for object recognition

Lu T., Peng L., Zhang Y.

Resumo

We address the problem of recognizing the object with distinctive edge features. For this purpose, a recognition approach based on local edge features is presented. First the edge features are detected in each image, and then its descriptor is computed to find the match features. Each match will give a vote with location, scale and orientation of the object. The recognition result can be found in the densest position in the vote space. Experimental results show that the presented method is robust and effective to the object with distinctive edge features.

Pattern Recognition and Image Analysis. 2016;26(2):350-353
pages 350-353 views

Evaluation of established line segment distance functions

Wirtz S., Paulus D.

Resumo

In this paper we present an evaluation of six well established line segment distance functions within the scope of line segment matching. We show analytically, using synthetic data, the properties of the distance functions with respect to rotation, translation, and scaling. The evaluation points out the main characteristics of the distance functions. In addition, we demonstrate the practical relevance of line segment matching and introduce a new distance function.

Pattern Recognition and Image Analysis. 2016;26(2):354-359
pages 354-359 views

Software and Hardware for Pattern Recognition and Image Analysis

The Chongqing University ChineSe Ear Video Database and its application

Liu J., Luo F., Huang H., Li L.

Resumo

Presently there already existed a few human ear image databases, which have been very influential in advancing the research on ear recognition. However no ear video databases are available. In this paper, we introduce the construction and basic content of the Chinese Ear Video Database (CEVD) and some primary evaluation results on it. The CEVD consists of 3600 ear video segments collected from 120 subjects. All the video segments in the database were collected in specially designed environment with three principal variations of illumination condition, viewing angle and interference. Compared with other public ear database, CEVD can not only be used for image-based applications but also video-based applications. In this paper we introduce the database and describe the collecting procedure. It excels in its large-scale and variation modes and is expected to have positive impact on the development and evaluation of ear recognition algorithms. This paper also gives experiment results of improved CamShift and AdaBoost algorithm on the database.

Pattern Recognition and Image Analysis. 2016;26(2):360-367
pages 360-367 views

Applied Problems

Real-time hand detection using continuous skeletons

Chernyshov V., Mestetskiy L.

Resumo

In this paper, a fast and reliable method for hand detection based on continuous skeletons approach is presented. It demonstrates real-time working speed and high detection accuracy (3–5% both FAR and FRR) on a large dataset (50 persons, 80 videos, 2322 frames). These make it suitable for use as a part of modern hand identification systems including mobile ones. Overall, the study shows that continuous skeletons approach can be used as prior for object and background color models in segmentation methods with supervised learning (e.g., interactive segmentation with seeds or abounding box).

Pattern Recognition and Image Analysis. 2016;26(2):368-373
pages 368-373 views

Reduction based similarity learning for high dimensional problems

Iofina G., Maximov Y.

Resumo

The problems of learning a good similarity function between objects naturally arise in machine learning, pattern recognition and data mining such as clustering, community detection or metric learning as well. We focus on the special case of this problem, where similarity function is completely determined by the hidden object classes. But we assume that no information about object labels is accessible on a training stage. The main contribution of the paper is two-stage algorithm assigns to each object its class label and provides a similarity function based on this assignment. We provide risk bounds and empirical evaluation in support of our algorithm. As a consequence of our analysis we provide a new tradeoff between empirical error of a multi-class classifier and its generalization error.

Pattern Recognition and Image Analysis. 2016;26(2):374-378
pages 374-378 views

On the false rejection ratio of face recognition based on automatic detected feature points

Ohzeki K., Takatsuka M., Kajihara M., Hirakawa Y., Sato K.

Resumo

The authors propose a new face recognition system with an evaluation function using feature points. The feature points are detected automatically by Milborrow’s Stasm software. Before recognition, rotation compensation and size normalization are applied to the feature points. The main method is to calculate the squared error between the registered face and the input face as to length of a characteristic pair of feature points on face. The False Rejection Rate (FRR) for the registered and input face of the same person, and the False Acceptance Rate (FAR) for the registered face and a different person’s input face are evaluated. The input is a video sequence. Stable recognition is obtained with small FRR and FAR for the video of a period of 0.5 s.

Pattern Recognition and Image Analysis. 2016;26(2):379-384
pages 379-384 views

An algorithm for recognizing linear objects in aerial photos automatically

Levashov A.

Resumo

A new algorithm for detecting linear infrastructural objects in aerial photos is presented. It is assumed that these objects pass through the whole image: beginning at one side and finishing at the opposite one. It is also assumed that the altitude of shooting and the image scale are invariable. The presented algorithm synthesizes the operation of an edge detector, a ridge detector, and the Hough accumulator into an object-of-interest mask excluding lots of spurious responses, and it completes missing information missed by detectors. First of all the image is preprocessed and anisotropically and repeatedly shrunk along the direction of the linear object and synthesis is performed by finding the shortest paths in a graph. The graph is presented in the form of a mesh, where each mesh node corresponds to a pixel of the shrunk image. At each node on the edges and in ridge lines, its energy is calculated, which is the reliability of this pixel. Then, the path that maximizes the sum of energies at the nodes is determined by considering its curvature. The obtained paths form a mask of linear objects. The algorithm is verified by using aerial photos for different seasons, and it demonstrates proper results (accuracy is ~80%).

Pattern Recognition and Image Analysis. 2016;26(2):385-397
pages 385-397 views

Location of pupil contour by Hough transform of connectivity components

Matveev I., Chinaev N., Novik V.

Resumo

A method for determining the pupil boundary in the image of eye is proposed. The method is based on image binarization followed by a search of the pupil as one of the connectivity components. The pupil boundary is determined as a part of boundary of the connectivity component. Hough transform is used for separating pupil in the case of its merging in one connectivity component with other objects, as well as to verify the likelihood of solution.

Pattern Recognition and Image Analysis. 2016;26(2):398-405
pages 398-405 views

Methods of analysis of geophysical data during increased solar activity

Mandrikova O., Polozov Y., Solovev I., Fetisova (Glushkova) N., Zalyaev T., Kupriyanov M., Dmitriev A.

Resumo

This work is directed at creation of methods of study of the processes in the ionospheric–magnetospheric system during increased solar and geomagnetic activity. Method of modeling and analysis of the parameters of the ionosphere, which allows prediction of the data and identification of the anomalies during the ionospheric disturbances, are given. Computational solutions for determination and estimation of the geomagnetic disturbances are described. Method of determination of the anomalous changes in the time course of cosmic rays, which allows qualitative estimations of the moments of their origination, duration, and intensity, is suggested.

On the basis of the methods elaborated, the data on the periods of strong and moderate magnetic storms are complexly analyzed. Sharp oscillations in the electron density of the ionosphere with positive and negative phases, which originate in the regions analyzed during an increase in geomagnetic activity, are distinguished. Positive phases of the ionospheric disturbances from several hours to one and a half days long were formed before the beginning of the magnetic storms. At the moments of the increase in the electron concentration, a local increase is observed in the level of cosmic rays (several hours before the magnetic storms) that supported the solar nature of these effects. During the strongest geomagnetic disturbances, the electron concentration in the ionosphere decreased significantly and led to prolonged negative phases of ionospheric storms, which coincided with the decrease in the level of cosmic rays (a Forbush decrease).

Pattern Recognition and Image Analysis. 2016;26(2):406-418
pages 406-418 views

A stochastic approach for association rule extraction

Oliinyk A., Subbotin S.

Resumo

This paper addresses the problem of association rule extraction. To extract quantitative association rules from given sets of observations, a stochastic method is proposed. The developed method improves the reliability and interpretability of recognition models based on association rules, employs the stochastic approach to search through various combinations of sets of elements in association rules, and uses a priori information about the informativity of intervals of feature values. A system of criteria for estimating association rules is developed that can be used to automate the analysis of properties and to compare various models based on association rules when solving pattern recognition problems.

Pattern Recognition and Image Analysis. 2016;26(2):419-426
pages 419-426 views

A two-phase solution procedure using mixtures of algorithms in the structure–property problem

Prokhorov E., Svitan’ko I., Zakharenko A., Sukhanova M., Bekker A., Perevoznikov A., Kumskov M.

Resumo

Prediction of the properties of chemical compounds by mathematical methods of pattern recognition is considered. The investigation was carried out by the example of the activity of cell division enzyme inhibitors. An approach based on mixtures of algorithms is used as the method for the construction of recognition models. A two-phase solution procedure for the structure–property problem is analyzed. The local classifier based on the nearest neighbor algorithm and the method of clustering sets is also described. New algorithms for the construction of classifier mixtures are compared. The methods of coordinated prediction of the activity of new compounds are examined. A comparison of mathematical modeling results with molecular design methods based on the coordination of compounds with known structures of therapeutic targets is also presented. An experimental study of the biological activity is conducted.

Pattern Recognition and Image Analysis. 2016;26(2):427-433
pages 427-433 views

Indian sign language recognition using SVM

Raheja J., Mishra A., Chaudhary A.

Resumo

Needs and new technologies always inspire people to make new ways to interact with machines. This interaction can be for a specific purpose or a framework which can be applied to many applications. Sign language recognition is a very important area where an easiness in interaction with human or machine will help a lot of people. At this time, India has 2.8M people who can’t speak or can’t hear properly. This paper targets Indian sign recognition area based on dynamic hand gesture recognition techniques in real-time scenario. The captured video was converted to HSV color space for pre-processing and then segmentation was done based on skin pixels. Also Depth information was used in parallel to get more accurate results. Hu-Moments and motion trajectory were extracted from the image frames and the classification of gestures was done by Support Vector Machine. The system was tested with webcam as well as with MS Kinect. This type of system would be helpful in teaching and communication of hearing impaired persons.

Pattern Recognition and Image Analysis. 2016;26(2):434-441
pages 434-441 views

A practical aspect of identification and classifying of Guns based on gunshot wound patterns using gene expression programming

Savakar D., Kannur A.

Resumo

This paper describes a practical aspect of identification and classifying of Guns based on gunshot wound patterns. We mark a genuinely digitized approach for the characteristic and set of guns used in homicidal cases using Gene expression programming. This approach develops a computationally attractive and effective alternative to investigate the guns used in crime which uses the images of gunshot wound patterns available on the human body. The experimental results achieved for identification and classification accuracy of 91.1 and 93.4%, respectively, on the available database of 30 images including three categories: Hard-contact, Loose-contact and Angled-contact of each pattern consisting of gunshot wounds. Our experimental results from the authentication experiments and false positive identification verses false negative identification also suggest the superiority of the proposed approach over the other popular feature extraction approach considered in this work.

Pattern Recognition and Image Analysis. 2016;26(2):442-449
pages 442-449 views

Speaker recognition regardless of context and language on a fixed set of competitors

Sorokin V., Leonov A., Trunov V.

Resumo

The problem of speaker recognition from a given set of speakers for any language and any context is considered. A database of Russian numerals that contains speech segments from 216 men and 177 women, each of whom spoke from 400 to 800 words, is used for recognition. Speech has been recorded on different types of microphones in different rooms at the natural noise level. Recognition is based on solutions of the inverse problem of finding the voice excitation pulse shape for each pitch period by the known speech segment. The pulse shape is defined as the inverse Fourier transform of the regularized ratio of speech signal spectra at the intervals of the open and closed glottis. Recognition is carried out by ten parameters: the pitch period, the open glottis interval duration, times when the source amplitude is maximum, minimum, or zero, the amplitude ratio for the minimum and maximum source pulses, three decomposition ratios of the source function by the principal component method, and the vowel duration. In such a recognition procedure, in the case of the utterance of a word that contains one vowel, the false reject rate (FRR) for men is 1.7–5.4%, and the false acceptance rate (FAR) is 5.4–7.1%. For women FRR = 2–5.2% and FAR = 5.2–6.3%. The recognition error decreases with an increasing number of vowels in the speech signal. At 10 vowels, for men FRR = 0.05–0.2% and FAR = 0.07–0.8%, and for women FRR = 0.09–0.2% and FAR = 0.17–2.1%.

Pattern Recognition and Image Analysis. 2016;26(2):450-459
pages 450-459 views