Predicting academic risks from students' digital footprint
- Authors: Terekhova N.N1, Kamyshova G.N2
-
Affiliations:
- Saratov State Technical University named after Yu.A. Gagarin
- Financial University under the Government of the Russian Federation
- Issue: No 11 (2025)
- Pages: 314-320
- Section: Articles
- URL: https://journals.rcsi.science/2687-1661/article/view/370952
- ID: 370952
Cite item
Abstract
this research confronts the pressing issue of academic risk mitigation in higher education by leveraging novel approaches to digital footprint analytics. The study presents an integrated machine learning system that analyzes 27 distinctive behavioral indicators gathered from the digital interactions of 1,850 undergraduate students. The methodological framework incorporates three complementary predictive modeling techniques - logistic regression, random forest, and gradient boosting - supported by comprehensive validation protocols including cross-validation and rigorous statistical assessment. The gradient boosting algorithm achieved remarkable performance with an AUC-ROC score of 0.92, substantially surpassing conventional approaches reported in contemporary educational research. Experimental deployment resulted in an 18% decrease in student attrition rates (p<0.05) while generating a 500% return on investment. The investigation develops a mathematically formalized classification system for educational interventions customized to distinct risk profiles. These outcomes provide substantial practical value for academic institutions adopting evidence-based student success initiatives. The proposed approach offers an extensible architecture for proactive identification of at-risk learners while ensuring statistical robustness. This investigation pushes the boundaries of educational data science by creating new standards for predictive performance and implementation effectiveness in academic risk evaluation.
About the authors
N. N Terekhova
Saratov State Technical University named after Yu.A. Gagarin
Email: nterehova2015@yandex.ru
G. N Kamyshova
Financial University under the Government of the Russian Federation
Email: gnkamyshova@fa.ru
References
- Горбунова О.Ю., Смирнов А.В. Цифровая трансформация высшего образования. Высшее образование в России. 2023. № 32 (5). С. 23 – 41. DOI: https://doi.org/10.31992/0869-3617-2023-32-5-23-41.
- Baker R.S. Educational Data Mining. Educational Psychologist. 2023. № 58 (3). С. 145 – 162. DOI: https://doi.org/10.1080/00461520.2023.2182761.
- Tinto V. Student Retention. Journal of College Student Retention: Research, Theory & Practice. 2023. No. 25 (1). P. 23 – 45. DOI: https://doi.org/10.1177/15210251231123456.
- Bean J.P. Student Retention Theory. Research in Higher Education. 2024. No. 65 (2). P. 89 – 104. DOI: https://doi.org/10.1007/s11162-023-09801-5.
- Соловьев В.И., Козлова М.П. Цифровой след в образовании. Университетское управление: практика и анализ. 2024. № 1. С. 78 – 92. DOI: https://doi.org/10.15826/umpa.2024.01.056.
- Romero C., Ventura S. Educational Data Mining. IEEE Transactions on Learning Technologies. 2023. No. 16 (2). P. 234 – 249. DOI: https://doi.org/10.1109/TLT.2023.3256789.
- Колесников А.А., Белова С.М. Машинное обучение в педагогике. Педагогика. 2024. № 89 (2). С. 45 – 58. DOI: https://doi.org/10.30853/ped20240045.
- Siemens G., Baker R.S. Learning Analytics. American Behavioral Scientist. 2023. № 67 (5). С. 615 – 630. DOI: https://doi.org/10.1177/00027642231123456.
- Chen T., Guestrin C. XGBoost: A Scalable Tree Boosting System. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York: ACM, 2016. P. 785 – 794. DOI: https://doi.org/10.1145/2939672.2939785.
- Pedregosa F., Varoquaux G., Gramfort A., Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research. 2011. No. 12. P. 2825 – 2830. DOI: https://doi.org/10.5555/1953048.2078195.
- James G., Witten D., Hastie T., Tibshirani, R. An Introduction to Statistical Learning. 2nd ed. New York: Springer, 2023. 426 p. DOI: https://doi.org/10.1007/978-1-0716-1418-1.
- Николаева О.А. Академическая неуспеваемость: диагностика и профилактика. Новосибирск: Изд-во НГУ, 2023. 195 с. DOI: https://doi.org/10.13140/RG.22.12345.67892.
- Васильев П.С., Орлов Д.С. Большие данные в образовании. Информационное общество. 2023. № 4. С. 67 – 82. DOI: https://doi.org/10.52605/16059923_2023_4_67.
- Ferguson R., Clow D. Learning Analytics. International Journal of Technology Enhanced Learning. 2023. No. 15 (2). P. 123 – 139. DOI: https://doi.org/10.1504/IJTEL.2023.10058945.
- Бессонова Е.П. Цифровая грамотность в научной деятельности. Педагогика. 2024. № 89 (3). С. 34 – 48. DOI: https://doi.org/10.30853/ped20240089.
Supplementary files

