


Vol 29, No 3 (2019)
- Year: 2019
- Articles: 23
- URL: https://journals.rcsi.science/1054-6618/issue/view/12271
General Problems, Methodology and Principles of Pattern Recognition and Image Analysis
Mathematical Foundations for Processing High Data Volume, Machine Learning, and Artificial Intelligence
Abstract
A brief outline of a report made November 12, 2018 at the Scientific Session of the Mathematical Science Branch of the Russian Academy of Sciences is presented in the paper. The report is orientated to an audience far from the problems on data mining. The report is an overview and presents the author’s personal considerations on the examined scientific field.



Mathematical Method in Pattern Recognition
A Clustering Based Classification Approach Based on Modified Cuckoo Search Algorithm
Abstract
Cuckoo Search Algorithm (CSA) is one of the new swarm intelligence based optimization algorithms, which has shown an effective performance on many optimization problems. However, the effectiveness of CSA significantly depends on the exploration and exploitation potential and it may also possible to increase its efficiency when solving complex optimization problems. In this study, some mechanisms have been employed on CSA to increase its efficiency such as use of global best and individual best solutions to guide the other solutions, self-adaption techniques for parameters and so on. The modified CSA (i.e., MCSA) is successfully employed in clustering based classification domain. The experimental results and execution time prove its effectiveness over existing modified CSAs and other employed swarm intelligence algorithms. The proposed clustering model is also employed in color histopathological image segmentation domain and provides effective result.



Lightweight Nearest Convex Hull Classifier
Abstract
A new type of classifier, the lightweight nearest convex hull (LNCH) classifier, is proposed. It is called lightweight due to the simplicity of its algorithm. It is based on a new method for estimating the proximity of the test point to the convex hull of a class in the case when the test point intersects convex hulls of the classes. The concept of the penetration depth of a point into a convex hull is used. Proximity is determined based on the analysis of extreme points projected on the direction vector from this point to the centroid of the class. A decision rule for multiclass problems is derived for the LNCH classifier using a new method for estimating the proximity. The results of experimental studies on synthesized numerical data and on real data for breast cancer diagnosis are given. The results indicate higher recognition accuracy of the LNCH classifier compared to other types of classifiers.



On a Classification Method for a Large Number of Classes
Abstract
The construction of a two-level decision scheme for recognition problems with many classes is proposed that is based on the development of the error-correcting output codes (ЕСОС) method. In the “classical” ЕСОС, a large number of partitions of the original classes into two macroclasses are constructed. Each macroclass is a union of some original classes. Each macroclass is assigned either 0 or 1. As a result, each original class is defined by a row of 0 and 1 (the stage of encoding) and a coding matrix is constructed. The stage of classification of an arbitrary new object consists in the solution of each dichotomic problem and application of a special decision rule (the stage of decoding). In this paper, new methods for weighting and taking into account codewords, modifying decision rules, and searching for locally optimal dichotomies are proposed, and various quality criteria for classification and the cases of extension of a codeword are considered.



Representation, Processing, Analysis, and Understanding of Images
Adaptive Detection of Normal Mixture Signals with Pre-Estimated Gaussian Mixture Noise
Abstract
The paper describes the adaptive method of estimating the parameters of the distribution of the useful signal under the assumption that the noise distribution can be pre-estimated. It is based on the method of moving separation of the finite normal mixtures and implemented for the estimating both signal-noise and signal distribution parameters. We assume that the probability distribution of the signal, signal with noise and “pure” noise can be presented in form of finite normal mixtures. Also, a method for change point detection based on testing the homogeneity hypothesis using the Kolmogorov criterion is proposed.



Panoramic Image Mosaics via Distributed Systems Using Color Moments and Local Wavelet-Features
Abstract
In this paper, an efficient method based on color moments and local wavelet-features is proposed for panoramic image mosaics. Color moments are used to efficiently represent physical quantities of objects in an image. Wavelet can describe a wide variety of image characteristics and is a key component for image-related applications. Therefore, during the feature-extraction stage for panoramic image mosaics, we exploit color moments and wavelet-subband statistics to construct local feature vectors for image-patch representation. With a distributed system of local area network, the proposed mosaics system can achieve an average 295 FPS. Experimental results show that the local wavelet-features are able to produce plausible and satisfactory panoramic images.



Algebraic Interpretation of Image Analysis Operations
Abstract
The study is devoted to mathematical and functional/physical interpretation of image analysis and processing operations used as sets of operations (ring elements) in descriptive image algebras (DIA) with one ring. The main result is the determination and characterization of interpretation domains of DIA operations: image algebras that make it possible to operate with both the main image models and main models of transformation procedures that ensure effective synthesis and realization of the basic procedures involved in the formal description, processing, analysis, and recognition of images. The applicability of DIAs in practice is determined by the realizability—the possibility of interpretation—of its operations. Since DIAs represent an algebraic language for the mathematical description of image processing, analysis, and understanding procedures using image transformation operations and their representations and models, the authors consider an algebraic interpretation. These procedures are formulated and implemented in the form of descriptive algorithmic schemes (DAS), which are correct expressions of the DIA language. The latter are constructed from the processing and transformation of images and other mathematical operations included in the corresponding DIA ring. The mathematical and functional properties of DIA operations are of considerable interest for optimizing procedures of processing and analyzing images and constructing specialized DAS libraries. Since not all mathematical operations have a direct physical equivalent, the construction of an efficient DAS for image analysis involves the problem of interpreting operations for DAS content. Research into this problem leads to the selection and study of interpretation domains of DIA operations. The proposed method for studying the interpretability of DIA operations is based on the establishment of correspondence between the content description of the operation function and its mathematical realization. The main types of interpretability are defined and examples given of the interpretability/uninterpretability of operations of a standard image algebra, which is a restriction of the DIA with one ring.



Image Classification Model Using Visual Bag of Semantic Words
Abstract
In the image classification field, the visual bag of words (BoW) has two drawbacks. One is low classification accuracy because a visual BoW is typically extracted from local low-level visual feature vectors via key points, without considering the high-level semantics of an image. The other is excessive time consumption because the size of the vocabulary is very large, especially for images with explicit backgrounds and object content. To solve these two problems, we propose a novel image classification model based on a visual bag of semantic words (BoSW), which includes an automatic segmentation algorithm based on graph cuts to extract major semantic regions and a semantic annotation algorithm based on support vector machine to label the regions with a visual semantic vocabulary. The proposed BoSW model refines image semantics by introducing user conceptions for extracting semantic vocabularies and reducing the size of the vocabulary. Experimental results demonstrate the superiority of the proposed algorithm through comparisons with state-of-the-art methods on benchmark datasets.



Robust Visual Tracking Based on Relaxed Target Representation
Abstract
Developing an effective target appearance model is a challenging task in visual tracking under the influences of complicated appearance variations. Many tracking algorithms use a linear combination of previous tracking results to represent a target candidate. In existing target representations, all of the feature elements of a target candidate have the same coding vector. With such type of target representations, robust tracking is not satisfactory when drastic appearance variations occur. In this work, we present a novel appearance model for visual tracking. The proposed appearance model considers the similarity and the distinctiveness of the feature elements of a target candidate. The feature elements should share some similarity to jointly represent a target pattern. We exploit the distinctiveness of feature elements to represent the different importance by introducing a weighted regularization term in the appearance model. A more stable and discriminative target representation is obtained. Superior performance on challenging sequences against state-of-the-art trackers show the robustness of the novel appearance model and the proposed tracker.



The Stability and Noise Tolerance of Cartesian Zernike Moments Invariants
Abstract
In real applications, it is quite common that shapes may have changes in orientation, scale, and viewpoint; a shape retrieval method should be unaffected by translation, rotation, and scaling. Zernike moments are widely used in shape retrieval, due to its rotation invariance. However, Zernike moments are not directly invariant under scaling and translation. Recently, Cartesian Zernike Moments Invariants (CZMI) were introduced to make Zernike moments directly invariant under scaling and translation. Although CZMI reduce the scale errors considerably, they are inconsistent and the scale errors increase for high aspect ratio shapes. In this paper, we introduce a scale invariance parameter which reduces the scale errors, improves the stability of the scale invariance and is more robust for wide range of shapes; even if the shapes are corrupted by different kinds of noises, such as Gaussian, Salt & Pepper and Speckle noise, our combined scale invariance parameter still has good performances.



Applied Problems
A Noninvasive Computerized Technique to Detect Anemia Using Images of Eye Conjunctiva
Abstract
Anemia is the blood disorder which develops in the condition of lack of healthy red blood cells or hemoglobin. According to the World Health Organization (WHO) nearly quarter of the human population suffers from anemia moreover, invasive detection of anemia is tedious and expensive. Initial screening for noninvasive detection of anemia is done by examining the color of eye conjunctiva and after that by accommodating the outcomes with an intrusive blood test. This paper aims to resolve this issues along with providing an optimal and fast solution for detecting the anemia using noninvasive methods. This process includes capturing the image of eyes and then manually extracting the eye conjunctiva and obtaining the region of interest (ROI). Once ROI is extracted, these images are processed to obtain the mean intensity values of red and green components of image pixels corresponding to ROI. Then a tuned machine learning algorithm is used to predict whether the patient is anemic or not. The model employed is run over 99 test subjects using k-Fold cross-validation and had achieved an accuracy of 93 percent. This study aims to develop an automated and cost-effective noninvasive technique.



A Fast Action Recognition Strategy Based on Motion Trajectory Occurrences
Abstract
A few light stimuli coherently distributed in the space and time are the essential input that a visual system needs to perceive motion. Inspired in such fact, a compact motion descriptor is herein proposed to describe patterns of neighboring trajectories for human action recognition. The proposed method introduces a strategy that models the local distribution of neighboring points by defining a spatial point process around motion trajectories. Particularly, a two-level occurrence analysis is carried out to discover motion patterns that underlying on trajectory points representation. Firstly, local occurrence words are computed over a circular grid layout that is centered in a fixed position for each trajectory. Then, a regional occurrence description is achieved by representing actions as the most frequent local words that occur in a particular video. This second occurrence layer could be computed for the entire video or by each frame to achieve an online recognition. This compact descriptor, with local size of 72 and sequence descriptor size of 400, acquires importance in real-time applications and environments with hardware restrictions. The proposed strategy was evaluated on KTH and Weizmann dataset, achieving an average accuracy of 91.2 and 78%, respectively. Moreover, a further online recognition was performed over UT-Interaction achieving an accuracy of 67% by using only the first 25% of video sequences.



About methods of Synthesis Complete Regression Decision Trees
Abstract
Abstract—Algorithms for regression problem on the basis of complete decision trees are examined in the paper. The examined structure of the decision tree makes it possible to consider all features meeting the branching criterion in each special vertex of the tree. New algorithms for synthesizing complete regression decisions trees are presented. The developed algorithms are tested on real problems and it is revealed that the algorithms are efficient.



Detection and Removal of Foreground Objects in Spherical Images for the Synthesis of Photorealistic Intermediate Images
Abstract
A method is proposed for the removal of foreground objects from spherical images as applied to the synthesis of intermediate images. The synthesis procedure is based on a 3D model of the captured scene. As a rule, the objects that appear in the frame but are not fixed in the model are deformed during the synthesis of an intermediate frame, thus making the result unrealistic. The method proposed allows one to remove such objects from the foreground. The method is based on the comparison of two images of the scene taken from close viewpoints and on the redundancy of available information. The method was tested on panoramas and models available in the Google Street View service.



A Combination of Global and Local Features for Brain White Matter Lesion Classification
Abstract
In the present time, the development of medical images and the increasing use of digital images have played a crucial role in medical diagnosis. In this sense, the rapid growth of magnetic resonance images (MRI) technology increased the necessity to store, analyze and describe this amount of information more efficiently. By improving these processes, decision making for clinicians may reach a more accurate and a well-informed diagnosis. To achieve this aim, machine learning approaches have been considered as a complementary pillar of image processing. It has become essential to find efficient descriptors, which characterize the images such as texture, shape and color. Although there are many representations of known features, they are difficult to use in their unchanged forms. The proposed approach combines the global and local features. Global features are extracted using the combination of magnitude and phase features of the descriptor angular radial transform (ART). Local features are obtained through local binary pattern (LBP). Four most known machine-learning approaches are then applied (SVM, ANN, NB, KNN) on both features and the combination of both techniques yielded the best tumor detection performance.



Construction of a Class of Logistic Chaotic Measurement Matrices for Compressed Sensing
Abstract
The construction of the measurement matrix is the key technology for accurate recovery of compressed sensing. In this paper, we demonstrated correlation properties of nonpiecewise and piecewise logistic chaos system to follow Gaussian distribution. The correlation properties can generate a class of logistic chaotic measurement matrices with simple structure, easy hardware implementation and ideal measurement efficiency. Specifically, spread spectrum sequences generated by the correlation properties follow Gaussian distribution. Thus, the proposed algorithm constructs chaos-Gaussian matrices by the sequences. Simulation results of one-dimensional signals and two-dimensional images show that chaos-Gaussian measurement matrices can provide comparable performance against common random measurement matrices. In addition, chaos-Gaussian matrices are deterministic measurement matrices.



A Computational Approach to Pertinent Feature Extraction for Diagnosis of Melanoma Skin Lesion
Abstract
Melanoma, starts growing in melanocytes, is less common but more serious and aggressive than any other types of skin cancers found in human. Melanoma skin cancer can be completely curable if it is diagnosed and treated in an early stage. Biopsy is a confirmation test of melanoma skin cancer which is invasive, time consuming, costly and painful. To prevent this problem, research regarding computerized analysis of skin cancer from dermoscopy images has become increasingly popular for last few years. In this research, we extract the pertinent features from dermoscopy images related to shape, size and color properties based on ABCD rule. Although ABCD features were used before, these features were mostly calculated to reflect asymmetry, compactness index as border irregularity, color variegation and average diameter. This paper proposes one asymmetry feature, three border irregularity features, one color feature and two diameter features as distinctive and pertinent. Implementation of our approach indicates that each of these proposed features is able to detect melanoma lesions with over 72% accuracy individually and the overall diagnostic system achieves 98% classification accuracy with 97.5% sensitivity and 98.75% specificity. Therefore, this method could assist dermatologist for making decision clinically.



Vision Based Approach for Adaptive Parking Lots Occupancy Estimation
Abstract
In the large cities, it remains difficult and expensive to create more parking spaces for vehicles since they have almost reached their full occupancy. The lack of available parking places leads to the problem of traffic congestion and consequently to pollution, since drivers will spend a lot of time looking for a vacant parking place. Hence the need to establish a parking management system able to improve the exploitation of the existent parking places at a city level. This paper proposes a new multi agent system for parking vacancies detection based on vision techniques. We explore a network of interconnected video surveillance cameras and parking stations to provide an intelligent service to the drivers in order to facilitate their task and improve the exploitation of the parking resources. The proposed system introduces a new approach for parking spaces modeling and elaborates an adaptive approach for vacancies estimation in order to supply the driver with reliable information about the vacant parking spaces in the city according to the size of his vehicle.



Predictive Diagnosis of Glaucoma Based on Analysis of Focal Notching along the Neuro-Retinal Rim Using Machine Learning
Abstract
Automatic evaluation of the retinal fundus image is regarded as one of the most important future tools for early detection and treatment of progressive eye diseases like glaucoma. Glaucoma leads to progressive degeneration of vision which is characterized by shape deformation of the optic cup associated with focal notching, wherein the degeneration of the blood vessels results in the formation of a notch along the neuroretinal rim. In this study, we have developed a methodology for automated prediction of glaucoma based on feature analysis of the focal notching along the neuroretinal rim and cup to disc ratio values. This procedure has three phases: the first phase segments the optic disc and cup by suppressing the blood vessels with dynamic thresholding; the second phase computes the neuroretinal rim width to detect the presence and direction of notching by the conventional ISNT rule apart from calculating the cup-to-disc ratio from the color fundus image (CFI); the third phase uses linear support vector based machine learning algorithm by integrating extracted parameters as features for classification of CFIs into glaucomatous or normal. The algorithm outputs have been evaluated on a freely available database of 101 images, each marked with decision of five glaucoma expert ophthalmologists, thereby returning an accuracy rate of 87.128%.



An Automatic Detection of Blood Vessel in Retinal Images Using Convolution Neural Network for Diabetic Retinopathy Detection
Abstract
Diabetes is a typical chronic disease that may remind to numerous complications. Since the diabetic patients, the diabetic retinopathy (DR) is standout amongst the most serious of these inconveniences and also most steady reasons of vision loss. Automatic detection of diabetic retinopathy at early stage is helping the ophthalmologist to treat the affected patient and avoid vision loss. Therefore, in this paper, we develop an efficient automatic diabetic detection in retinal images using convolution neural network. The suggested system mainly comprises of five modules such as (i) preprocessing, (ii) blood vessel segmentation, (iii) exudates segmentation, (iv) texture feature extraction, and (v) diabetic detection. At first, the preprocessing step is carried out using adaptive histogram equalization (AHE) for enhancing the input retinal image. Consequently, blood vessel segmentation and exudates segmentation are done using convolution neural network (CNN) and fuzzy c-means clustering (FCM) respectively. Then, texture features are extracted from blood vessel and exudates. After the feature extraction, the diabetic classification is done with the help of support vector machine. The experimental results demonstrate that the proposed approach accomplishes better diabetic detection result (accuracy, sensitivity, and specificity) compared to other approaches.



Detection and Restoration of Image from Multi-Color Fence Occlusions
Abstract
In Image De-fencing, segmentation and restoration of occluded fence region from the images are very challenging, when the desired object is occluded with an undesired object. This undesired object may be scattered in the whole image region. Existing algorithms can only detect single colored fences at a time from the digital images. This paper presents a multi-colored fence detection algorithm. Multi-threshold segmentation technique is used to segment the fence in the image. The segmented mask is amended by using morphological operations. To restore the fence occluded area in an image hybrid inpainting technique is used. Obtained results after experimentation are compared with the start-of-art image de-fencing technique.



Application of Particle Filter Algorithm Based on Gaussian Clustering in Dynamic Target Tracking
Abstract
In order to solve the problem of particle depletion and complexity in traditional particle filtering, a particle filter target tracking method based on Gaussian clustering is proposed in this paper. Combined with adaptive double-sampling and gradmethod, the real time and robustness of dynamic tracking are improved significantly.



Application of Quantum-Clustering on Thermograms of WiFi Circuits in Different Operation Modes
Abstract
The purpose of this work is to evaluate the efficacy of applying a model-based quantum clustering (QC) algorithm on thermograms of functional modes in WiFi circuits. As unsupervised clustering algorithm, it can work on clusters of any shape and does not require any prior information. QC proves its efficacy for many applications, it has been tested, in this work, and compared with other algorithms which suffer randomness according to initialization. The tests are conducted on thermograms of an electronic chip in different operation modes. The benefits of QC are confirmed through performance analysis of clustering algorithms. Robustness analysis is also conducted against white-Gaussian noise clustering and so on classification of actual WiFi circuit operation modes based on thermograms.


