Mathematical model of the integral classification criterion and the algorithm for forming its graphical representation
- 作者: Bozhday A.S.1, Gorshenin L.N.1
-
隶属关系:
- Penza State University
- 期: 编号 3 (2025)
- 页面: 5-16
- 栏目: COMPUTER SCIENCE, COMPUTER ENGINEERING AND CONTROL
- URL: https://journals.rcsi.science/2072-3059/article/view/355011
- DOI: https://doi.org/10.21685/2072-3059-2025-3-1
- ID: 355011
如何引用文章
全文:
详细
Background. The most important method of scientific research is classification. The task of classifying objects with a heterogeneous space of information features, which currently has no unified solution, is particularly difficult. It is important to take into account the dependence of classification results on chosen point of view on subject area – a criterion that determines composition and typology of significant features of objects under consideration. The purpose of the work is the mathematical and algorithmic formalization of the process of developing classification criteria, as well as development of a unified format for presenting information features of an object regardless of their type, composition and completeness. This will reduce classification task to a single automated process - well-managed, visual, amenable to machine learning and invariant to specifics of subject area. Materials and methods. The work reveals the main problems of traditional approaches to solving the classification problem, in context of dynamic points of view on subject area. Existing works devoted to formalization of classification criteria and problems of choosing significant information features of objects are considered. The methodological basis of the work is set theory, machine learning methods, neural networks and raster computer graphics. Results. As a result, a concept and formalized set-theoretic representation of the integral classification criterion are proposed. The criterion allows algorithmization of choice of view’s points on the studied subject area. Classification criteria determine composition of significant features of the object. For its unified representation, a single format in the form of a raster graphic image - a graphic-chromatic map, as well as an algorithm for creation of such maps have been developed. Conclusions. The proposed approach significantly simplifies the procedure of distributing heterogeneous objects into classes by reducing it to a well-studied and proven task of machine learning - classification of raster graphic images. Formalization of the integral classification criterion allows quickly and conveniently set different points of view on the subject area, representing them in the form of data tuples. The algorithm for forming grapho-chromatic maps provides obtaining raster images of a single size and format for feeding to the input of a pretrained neural network classifier. At the same time, when changing the point of view on the subject area (i.e. when changing the number and composition of significant features of objects), there is no need to change the structure of the classifier or retrain it.
作者简介
Aleksandr Bozhday
Penza State University
编辑信件的主要联系方式.
Email: bozhday@yandex.ru
Doctor of engineering sciences, professor, professor of the sub-department of computer aided design systems
(40 Krasnaya street, Penza, Russia)Lev Gorshenin
Penza State University
Email: gorshenin.lev@gmail.com
Postgraduate student
(40 Krasnaya street, Penza, Russia)参考
- Berisha B., Mëziu E. Big Data Analytics in Cloud Computing: An overview. Journal of cloud computing. 2022;11(24). doi: 10.1186/s13677-022-00301-w
- Lim J. What is data classification? Alation. Available at: https://www.alation.com/blog/what-is-data-classification/ (accessed 12.04.2024).
- Yang Y., Wu Y.F., Zhan D.Ch., Liu Zh.B., Jiang Y. Complex object clas-sification: A multi-modal multi-instance multi-label deep network with optimal transport. Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2018:2594‒2603.
- Krasnyanskiy M.N., Obukhov A.D., Voyakina A.A., Solomatina E.M. A comparative analysis of machine learning methods for solving the problem of document classification in a scientific and educational institution. Vestnik Voronezhskogo gosudarstvennogo universiteta. Seriya: Sistemnyy analiz i informatsionnye tekhnologii = Bulletin of Voronezh State University. Series: Systems analysis and information technology. 2018;(3):173‒182. (In Russ.). doi: 10.17308/sait.2018.3/1245
- Korshunov A., Beloborodov I., Gomzin A., Chuprina K., Astrakhantsev N., Nedumov Ya., Turdakov D. Determining demographic attributes of microblogging users. Trudy Instituta sistemnogo programmirovaniya RAN = Proceedings of the Institute for System Programming of the Russian Academy of Sciences. 2013:179‒194. (In Russ.)
- Kulikova A.A. An approach to classifying users in social networks. Vostochnoevropeyskiy zhurnal peredovykh tekhnologiy = East European Journal of Advanced Technologies. 2011;(2):14‒18. (In Russ.)
- Stankevich M.A., Ignat'ev N.A., Smirnov I.V., Kisel'nikova N.V. Identifying personality traits in VKontakte users. Voprosy kiberbezopasnosti = Cybersecurity issues. 2019;(4):80‒87. (In Russ.). doi: 10.21681/2311-3456-2019-4-80-87
- Zhiryaeva E.V., Naumov V.N. Method of text analysis in tariff classification of goods in customs matters. Programmnye produkty i sistemy = Software products and systems. 2020;(3):538‒548. (In Russ.)
- Setlak G. Using artificial neural networks to solve classification problems in management. Radіoelektronіka, іnformatika, upravlіnnya = Radio electronics, computer science, management. 2004;(1):127‒135. (In Russ.)
- Nima M., Histon J., Dizaji R., Waslander S.L. Feature extraction and radar track classification for detecting UAVs in civillian airspace. 2014 IEEE Radar Conference. 2014:0674‒0679. doi: 10.1109/RADAR.2014.6875676
- Sentsov A.A., Polyakov V.B., Ivanov S.A., Pomozova T.G. Method of intercepting small-sized and stealthy unmanned aerial vehicles. Trudy MAI = Proceedings of MAI. 2023;(21). (In Russ.)
- Bozhday A.S., Gorshenin L.N. A method for classifying objects with a heterogeneous set of information features based on the generation of graphochromatic maps. Izvestiya vysshikh uchebnykh zavedeniy. Povolzhskiy region. Tekhnicheskie nauki = University proceedings. Volga region. Engineering sciences. 2024;(3):5‒13. (In Russ.). doi: 10.21685/2072-3059-2024-3-1 EDN: BGBNIC
- Stepp R., Michalsky R.S. Conceptual clustering of structured objects: a goal-oriented approach. AI Journal. 1986;103‒110.
- Selić B., Pierantonio A. Fixing Classification: A Viewpoint-Based Approach. Leveraging Applications of Formal Methods, Verification and Validation. ISoLA 2021. Lecture Notes in Computer Science. 2021:346‒356. (In Russ.). doi: 10.1007/978-3-030-89159- 6_22
- Tang J., Alelyani S., Liu H. Feature Selection for Classification: A Review. Data Classification: Algorithms and Applications. 2014. doi: 10.1201/b17320-3
- Dash M., Liu H. Feature Selection for Classification. Intelligent Data Analysis. 1997:131‒156.
- Rawat W., Wang Z. Deep convolutional neural networks for image classification: A comprehensive review. Neural Computation. 2017;(29):2352‒2449.
- Tuples and the cartesian product of sets. Regional'nyy finansovo-ekonomicheskiy institute = Regional Financial and Economic Institute. (In Russ.). Available at: https://it.rfei.ru/course/~mBme/~6v6/~0pCLCk (accessed 20.05.2025).
- Guid. Microsoft. Available at: https://learn.microsoft.com/ruru/ dotnet/api/system.guid?view=net-8.0 (accessed 10.05.2025).
- Bozhday A.S., Gorshenin L.N. Development of an algorithm for generating graphicchromatic maps for solving classification problems. Vestnik Penzenskogo gosudarstvennogo universiteta = Bulletin of Penza State University. 2024;(4):22‒26. (In Russ.). EDN: LOCGJQ
补充文件

