<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE root>
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:ali="http://www.niso.org/schemas/ali/1.0/" article-type="research-article" dtd-version="1.2" xml:lang="en"><front><journal-meta><journal-id journal-id-type="publisher-id">Computational nanotechnology</journal-id><journal-title-group><journal-title xml:lang="en">Computational nanotechnology</journal-title><trans-title-group xml:lang="ru"><trans-title>Computational nanotechnology</trans-title></trans-title-group></journal-title-group><issn publication-format="print">2313-223X</issn><issn publication-format="electronic">2587-9693</issn><publisher><publisher-name xml:lang="en">YUR-VAK</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="publisher-id">358392</article-id><article-id pub-id-type="doi">10.33693/2313-223X-2025-12-5-143-153</article-id><article-id pub-id-type="edn">EZTMLX</article-id><article-categories><subj-group subj-group-type="toc-heading" xml:lang="en"><subject>INFORMATICS AND INFORMATION PROCESSING</subject></subj-group><subj-group subj-group-type="toc-heading" xml:lang="ru"><subject>ИНФОРМАТИКА И ИНФОРМАЦИОННЫЕ ПРОЦЕССЫ</subject></subj-group><subj-group subj-group-type="article-type"><subject>Research Article</subject></subj-group></article-categories><title-group><article-title xml:lang="en">A Model for the intelligent analysis and detection of anomalies in the data of statistical observation of educational organizations</article-title><trans-title-group xml:lang="ru"><trans-title>Модель интеллектуального анализа и обнаружения аномалий в данных статистического наблюдения за образовательными организациями</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author"><contrib-id contrib-id-type="spin">1383-7078</contrib-id><name-alternatives><name xml:lang="en"><surname>Vinogradov</surname><given-names>Nikita E.</given-names></name><name xml:lang="ru"><surname>Виноградов</surname><given-names>Никита Евгеньевич</given-names></name></name-alternatives><address><country country="RU">Russian Federation</country></address><bio xml:lang="en"><p>graduate student, Institute for Advanced Technologies and Industrial Programming</p></bio><bio xml:lang="ru"><p>аспирант, Институт перспективных технологий и индустриального программирования</p></bio><email>vinogradov_n@mirea.ru</email><xref ref-type="aff" rid="aff1"/></contrib><contrib contrib-type="author"><contrib-id contrib-id-type="orcid">https://orcid.org/0000-0003-1690-7961</contrib-id><contrib-id contrib-id-type="scopus">57205359470</contrib-id><contrib-id contrib-id-type="researcherid">B-5750-2017</contrib-id><contrib-id contrib-id-type="spin">7619-6288</contrib-id><name-alternatives><name xml:lang="en"><surname>Vostroknutov</surname><given-names>Igor E.</given-names></name><name xml:lang="ru"><surname>Вострокнутов</surname><given-names>Игорь Евгеньевич</given-names></name></name-alternatives><address><country country="RU">Russian Federation</country></address><bio xml:lang="en"><p>Dr. Sci. (Pedag.), Professor, Institute for Advanced Technologies and Industrial Programming</p></bio><bio xml:lang="ru"><p>доктор педагогических наук, профессор, Институт перспективных технологий и индустриального программирования</p></bio><email>vostroknutov_i@mail.ru</email><xref ref-type="aff" rid="aff1"/></contrib></contrib-group><aff-alternatives id="aff1"><aff><institution xml:lang="en">MIREA – Russian Technological University</institution></aff><aff><institution xml:lang="ru">МИРЭА – Российский технологический университет</institution></aff></aff-alternatives><pub-date date-type="pub" iso-8601-date="2025-12-14" publication-format="electronic"><day>14</day><month>12</month><year>2025</year></pub-date><volume>12</volume><issue>5</issue><fpage>143</fpage><lpage>153</lpage><history><date date-type="received" iso-8601-date="2025-12-16"><day>16</day><month>12</month><year>2025</year></date></history><permissions><copyright-statement xml:lang="en">Copyright ©; 2025, Yur-VAK</copyright-statement><copyright-statement xml:lang="ru">Copyright ©; 2025, Юр-ВАК</copyright-statement><copyright-year>2025</copyright-year><copyright-holder xml:lang="en">Yur-VAK</copyright-holder><copyright-holder xml:lang="ru">Юр-ВАК</copyright-holder><license><ali:license_ref xmlns:ali="http://www.niso.org/schemas/ali/1.0/">https://www.urvak.ru/contacts/</ali:license_ref></license></permissions><self-uri xlink:href="https://journals.rcsi.science/2313-223X/article/view/358392">https://journals.rcsi.science/2313-223X/article/view/358392</self-uri><abstract xml:lang="en"><p>This article describes an algorithm for applying an intelligent analysis model to detect anomalies in statistical observation data for educational organizations. The definition of an anomaly is given, typical anomalies that may be contained in statistical reporting data are analyzed. The classification of anomaly detection techniques is given depending on the level of markup of the training sample, and possible ways of marking up data to present the results of the anomaly search are analyzed. The analysis and description of the process of collecting and processing statistical data of educational organizations in the Scientific and Technical Center of RTU MIREA is carried out. The weaknesses of the data collection process are analyzed, which can be strengthened by applying intelligent analysis to search for anomalies in the data. The analysis and mathematical description of the format and features of the received and stored statistical data is carried out. An algorithm has been developed for preparing data for training an intelligent analysis model, taking into account their specifics, as well as the subsequent application of the trained model to detect anomalies in the data under consideration. The algorithm was tested on real data using the autoencoder neural network model.</p></abstract><trans-abstract xml:lang="ru"><p>В<bold> </bold>статье описан алгоритм применения модели интеллектуального анализа для обнаружения аномалий в данных статистического наблюдения за образовательными организациями. Дано определение аномалии, проанализированы типовые аномалии, которые могут содержаться в данных статистической отчетности. Приведена классификация методик выявления аномалий в зависимости от уровня размеченности обучающей выборки, а также проанализированы возможные способы разметки данных для представления результатов поиска аномалий. Проведены анализ и описание процесса сбора и обработки статистических данных образовательных организаций в ГИВЦ РТУ МИРЭА. Проанализированы слабые места процесса сбора данных, которые возможно усилить путем применения интеллектуального анализа для поиска аномалий в данных. Разработана математическая модель обработки данных и поиска аномалий. Предложен алгоритм подготовки данных для обучения модели интеллектуального анализа с учетом их специфики, а также последующего применения обученной модели для обнаружения аномалий в рассматриваемых данных. Произведена проверка работы алгоритма на реальных данных с использованием нейросетевой модели автоэнкодер.</p></trans-abstract><kwd-group xml:lang="en"><kwd>anomaly detection</kwd><kwd>statistical data</kwd><kwd>data mining</kwd><kwd>autoencoder</kwd></kwd-group><kwd-group xml:lang="ru"><kwd>обнаружение аномалий в данных</kwd><kwd>статистические данные</kwd><kwd>интеллектуальный анализ</kwd><kwd>автоэнкодер</kwd></kwd-group><funding-group/></article-meta></front><body></body><back><ref-list><ref id="B1"><label>1.</label><citation-alternatives><mixed-citation xml:lang="en">Bardasova I.A., Volkova E.A. Anomaly detection in emails using machine learning. Vestnik Nauki. 2024. Vol. 4. No. 5 (74). Pp. 1350–1358. (In Rus.)</mixed-citation><mixed-citation xml:lang="ru">Бардасова И.А., Волкова Е.А. Обнаружение аномалий в электронных письмах с помощью машинного обучения // Вестник науки. 2024. Т. 4. № 5 (74). С. 1350–1358.</mixed-citation></citation-alternatives></ref><ref id="B2"><label>2.</label><citation-alternatives><mixed-citation xml:lang="en">Grusho A.A. et al. Anomaly detection in big data. Sistemy i Sredstva Informatiki. 2022. Vol. 32. No. 1. Pp. 160–167. (In Rus.)</mixed-citation><mixed-citation xml:lang="ru">Грушо А.А. и др. Поиск аномалий в больших данных // Системы и средства информатики. 2022. Т. 32. № 1. С. 160–167.</mixed-citation></citation-alternatives></ref><ref id="B3"><label>3.</label><citation-alternatives><mixed-citation xml:lang="en">Mikhnyaev A.L. et al. Anomaly detection in technological process data. In: Proceedings of the Conference “Digital Environment: Technologies and Prospects” (DETP 2022). N.N. Shalobyta (ed.) et al. Brest: Belarus: Brest State Technical University, 2022. Pp. 62–65.</mixed-citation><mixed-citation xml:lang="ru">Михняев А.Л. и др. Обнаружение аномалий в данных технологического процесса // Цифровая среда: технологии и перспективы: матер. конф. “Digital Environment: Technologies and Prospects” DETP 2022 (Брест, 31 октября 2022 г.). / ред. Н.Н. Шалобыта и др. Брест: БрГТУ, 2022. С. 62–65.</mixed-citation></citation-alternatives></ref><ref id="B4"><label>4.</label><citation-alternatives><mixed-citation xml:lang="en">Safronov D.A., Katser Yu.D., Zaitsev K.S. Anomaly detection using autoencoders. International Journal of Open Information Technologies. 2022. Vol. 10. No. 8. Pp. 39–45. (In Rus.)</mixed-citation><mixed-citation xml:lang="ru">Сафронов Д.А., Кацер Ю.Д., Зайцев К.С. Поиск аномалий с помощью автоэнкодеров // International Journal of Open Information Technologies. 2022. Т. 10. № 8. С. 39–45.</mixed-citation></citation-alternatives></ref><ref id="B5"><label>5.</label><citation-alternatives><mixed-citation xml:lang="en">Tokarev V.V. A review of machine learning methods used for outlier removal in datasets. In: Scientific review: Current issues of theory and practice. Proceedings of the V International Scientific and Practical Conference. Penza: Nauka i Prosveshchenie, 2023. Pp. 18–24.</mixed-citation><mixed-citation xml:lang="ru">Токарев В.В. Обзор методов машинного обучения, используемых для удаления выбросов в наборе данных // Научное обозрение: актуальные вопросы теории и практики: сборник статей V Международной научно-практической конференции. Пенза: Наука и Просвещение, 2023. С. 18–24.</mixed-citation></citation-alternatives></ref><ref id="B6"><label>6.</label><citation-alternatives><mixed-citation xml:lang="en">Shkodyrev V.P. et al. A survey of anomaly detection methods in data streams. In: Proceedings of the Second Conference on Software Engineering and Information Management. Vol. 1864. St. Petersburg, 2017.</mixed-citation><mixed-citation xml:lang="ru">Шкодырев В.П. и др. Обзор методов обнаружения аномалий в потоках данных // Proc. of the Second Conference on Software Engineering and Information Management. Т. 1864. СПб., 2017.</mixed-citation></citation-alternatives></ref><ref id="B7"><label>7.</label><citation-alternatives><mixed-citation xml:lang="en">Shelukhin O.I., Ryabinin V.S. Anomaly detection in big data from unstructured system logs. Voprosy Kiberbezopasnosti. 2019. Vol. 2. No. 30. Pp. 36–41. (In Rus.)</mixed-citation><mixed-citation xml:lang="ru">Шелухин О.И., Рябинин В.С. Обнаружение аномалий больших данных неструктурированных системных журналов // Вопросы кибербезопасности. 2019. № 2 (30). С. 36–41.</mixed-citation></citation-alternatives></ref><ref id="B8"><label>8.</label><citation-alternatives><mixed-citation xml:lang="en">Zimek A., Schubert E. Outlier detection. In: Encyclopedia of database systems. New York: Springer, 2017.</mixed-citation><mixed-citation xml:lang="ru">Zimek A., Schubert E. Outlier detection // Encyclopedia of database systems. New York: Springer, 2017.</mixed-citation></citation-alternatives></ref><ref id="B9"><label>9.</label><citation-alternatives><mixed-citation xml:lang="en">Braei M., Wagner S. Anomaly detection in univariate time-series: A survey on the state-of-the-art. arXiv preprint arXiv:2004.00433. 2020.</mixed-citation><mixed-citation xml:lang="ru">Braei M., Wagner S. Anomaly detection in univariate time-series: A survey on the state-of-the-art // arXiv preprint arXiv:2004.00433. 2020.</mixed-citation></citation-alternatives></ref><ref id="B10"><label>10.</label><citation-alternatives><mixed-citation xml:lang="en">Chandola V., Banerjee A., Kumar V. Anomaly detection: A survey. ACM computing surveys (CSUR). 2009. Vol. 41. No. 3. Pp. 1–58.</mixed-citation><mixed-citation xml:lang="ru">Chandola V., Banerjee A., Kumar V. Anomaly detection: A survey // ACM computing surveys (CSUR). 2009. Vol. 41. No. 3. Pp. 1–58.</mixed-citation></citation-alternatives></ref><ref id="B11"><label>11.</label><mixed-citation>Salgado C.M., Azevedo C., Proenca H., Vieira S.M. Noise versus outliers. Cham: Springer International Publishing, 2016. Pp. 163–183.</mixed-citation></ref><ref id="B12"><label>12.</label><mixed-citation>Charu C. Aggarwal. outlier analysis. 2nd ed. Springer Publishing Company, Inc., 2016. ISBN: 3319475770.</mixed-citation></ref><ref id="B13"><label>13.</label><citation-alternatives><mixed-citation xml:lang="en">Hojjati H., Ho T.K.K., Armanfard N. Self-supervised anomaly detection: A survey and outlook. arXiv preprint arXiv:2205.05173. 2022.</mixed-citation><mixed-citation xml:lang="ru">Hojjati H., Ho T.K.K., Armanfard N. Self-supervised anomaly detection: A survey and outlook // arXiv preprint arXiv:2205.05173. 2022.</mixed-citation></citation-alternatives></ref><ref id="B14"><label>14.</label><citation-alternatives><mixed-citation xml:lang="en">Ruff L. et al. A unifying review of deep and shallow anomaly detection. Proceedings of the IEEE. 2021. Vol. 109. No. 5. Pp. 756–795.</mixed-citation><mixed-citation xml:lang="ru">Ruff L. et al. A unifying review of deep and shallow anomaly detection // Proceedings of the IEEE. 2021. Vol. 109. No. 5. Pp. 756–795.</mixed-citation></citation-alternatives></ref><ref id="B15"><label>15.</label><citation-alternatives><mixed-citation xml:lang="en">Smiti A. A critical overview of outlier detection methods. Computer Science Review. 2020. Vol. 38. P. 100306.</mixed-citation><mixed-citation xml:lang="ru">Smiti A. A critical overview of outlier detection methods // Computer Science Review. 2020. Vol. 38. P. 100306.</mixed-citation></citation-alternatives></ref></ref-list></back></article>
