Educational data mining for predicting the academic performance of university students
- Authors: Popova N.A.1, Egorova E.S.2
-
Affiliations:
- Penza State University
- Penza State Technological University
- Issue: No 2 (2023)
- Pages: 18-29
- Section: Informatics and information processes
- Submitted: 18.11.2025
- Published: 04.02.2026
- URL: https://journals.rcsi.science/1991-6639/article/view/351950
- DOI: https://doi.org/10.35330/1991-6639-2023-2-112-18-29
- EDN: https://elibrary.ru/GXEHAC
- ID: 351950
Cite item
Full Text
Abstract
Progress in the field of data mining makes it possible to use educational data to improve the quality of educational processes. This article examines various methods of analyzing student achievement data. The focus is on two aspects: first, predicting students' academic achievements at the end of a four-year undergraduate curriculum; second, examining typical student progressions and combining them with the prediction results. Approximately 10 classification algorithms were used in the prediction process. An approach to improving the performance of classification methods is proposed where classifier attributes are selected during their training. Two important groups of students were identified: low-achieving and high-achieving students. The results show that by focusing on a small number of courses that are indicators of particularly good or poor performance, it is possible to prevent and support low-achieving students in a timely manner, and to provide advice and opportunities to high-achieving students.
About the authors
Nataliya A. Popova
Penza State University
Email: popov.tasha@yandex.ru
ORCID iD: 0000-0001-9713-4897
Candidate of Technical Sciences, Associate Professor of the Department of Mathematical Support and Computer Use
Russian Federation, 440026, Russia, Penza, 40 Krasnaya streetEkaterina S. Egorova
Penza State Technological University
Author for correspondence.
Email: katepost@yandex.ru
ORCID iD: 0000-0002-0816-0944
Candidate of Economic Sciences, Associate Professor of the Department of Applied Informatics
Russian Federation, 440039, Russia, Penza, 1a/11 Baidukova passage/Gagarina streetReferences
- Belonozhko P.P., Karpenko A.P., Khramov D.A. Analysis of educational data: directions and prospects for application. Internet-zhurnal "Naukovedeniye" [SCIENCE online journal]. Vol. 9. No. 4 (2017). URL: http://naukovedenie.ru/PDF/15TVN417.pdf (In Russian)
- Mamontova M.Yu. Quality of educational achievements: assessment and forecast based on the results of criteria-oriented testing. Education and science. News of the Ural Branch of the Russian Academy of Education. 2009. No. 3(60). Pp. 18-26. (In Russian)
- Rusakov S.V., Nakaryakova N.N. Forecasting the progress of first-year students using a decision tree based on their results of passing the exam. Nauka. Informatizatsiya. Tekhnologii. Obrazovaniye [Science. Informatization. Technologies. Education]: Materialy XI mezhdunarodnoy nauchno-prakticheskoy konferentsii. Yekaterinburg: Rossiyskiy gosudarstvennyy professional'no-pedagogicheskiy universitet, 2018. Pp. 589-594. (In Russian)
- Firstov V. E. Sociometric and informational aspects of clustering the student contingent in the organization and optimization of group cooperation in the educational process at school and university. Izvestiya of Saratov University. New series. Series: Philosophy. Psychology. Pedagogy. 2014. Vol. 14. No. 1. Pp. 110-118. (In Russian)
- Medvedev D., D'yakonov A. New Properties of the Data Distillation Method When Working with Tabular Data. Conference proceedings "Analysis of Images, Social Networks and Texts". Lecture Notes in Computer Science. Vol. 12602. Springer, Cham. 2021. https://doi.org/ 10.1007/978-3-030-72610-2_29
- Sucholutsky I., Schonlau M. Soft-Label Dataset Distillation and Text Dataset Distillation. International Joint Conference on Neural Networks, Shenzhen, China, 2021. Pp. 1-8. doi: 10.1109/IJCNN52387.2021.9533769.
- Nikonorova M.L. Computer model for solving classification problems in the Rapid Miner software environment. Medical education and professional development. 2017. No. 2-3(28-29). Pp. 24-33. (In Russian)
- Filyak P.Yu., Vinogradov M.A. Application of Rapid miner and open environments as data mining tools for security. Informatsiya i bezopasnost' [Information and security]. 2017. Vol. 20. No. 4. Pp. 552-555. (In Russian)
- Maimon O., Rokach L. Data Mining and Knowledge Discovery Handbook. Springer Science, Business Media, 2010. 1285 p. ISBN: 978-0-387-09822-7.
Supplementary files



