Extraction of Data Features for Neuro-Classifier Input

封面

如何引用文章

全文:

详细

The problem of essential data compression to be input to ANN-classifier without loosing significant information is considered on the example of the quite substantial task of the genetic protein structure analysis, which is important for genetic biology researches in radiobiology and, especially, in agricultural. Such analysis is usually carried out by studying ElectroPhoretic Spectra (EPS) of gliadin (alcohol soluble protein) of the inspected grain cultivar. EPS digitization produces a densitogram with 4 thousands counts, which most informative features must be extracted to be input to ANN. Besides these data require special preprocessing for densitogram smoothing, pedestal eliminating, as well as compensating such digitization orocess defects as signal noise, variability of spectrum borders and illumination, their non-linear starches due to electrophoresis nonstationarity.
Several alternative approaches to features extracting were studied: (1) the densitogram coarsing into 200 averaged measurements; (2) the principal component analysis; (3) recognition of all well-pronounced peaks in order to evaluate their parameters to be input to ANN; (4)-(5) data compression by both discrete Fourier (DFT) and wavelet (DWT) transformations. These methods have been used for feature extraction from samples formed by experts for 30 different sorts. Then extracted features were used to train ANN of three-layer perceptron type. The comparative study of the recognition efficiency with data compressed by the methods listed above shows their high sensitivity to the number of sorts to be classified. Only DFT and DWT approaches could keep the efficiency on the level 95-97% up to 20 sorts.
A further development of feature extraction methods and a study of possibility to develop a hierarchy of classifying ANNs are intended.

作者简介

G Ososkov

Joint Institute for Nuclear Research

Email: ososkov@jinr.ru
Лаборатория информационных технологий; Объединённый институт ядерных исследований; Joint Institute for Nuclear Research

D Baranov

Joint Institute for Nuclear Research

Лаборатория информационных технологий; Объединённый институт ядерных исследований; Joint Institute for Nuclear Research

补充文件

附件文件
动作
1. JATS XML