Intelligent System of Analytical Processing of Digital Network Content for Protection Against Inappropriate Information

I. V Kotenko; Котенко И. В; I. B Saenko; Саенко И. Б; A. A Branitskiy; Браницкий А. А; I. B Parashchuk; Паращук И. Б; D. A Gaifulina; Гайфулина Д. А

doi:10.15622/ia.20.4.1

Intelligent System of Analytical Processing of Digital Network Content for Protection Against Inappropriate Information

Authors: Kotenko I.V¹, Saenko I.B¹, Branitskiy A.A¹, Parashchuk I.B¹, Gaifulina D.A¹
Affiliations:
1. SPC RAS
Issue: Vol 20, No 4 (2021)
Pages: 755-792
Section: Information security
URL: https://journals.rcsi.science/2713-3192/article/view/266322
DOI: https://doi.org/10.15622/ia.20.4.1
ID: 266322

Cite item

Full Text

Abstract
About the authors
References
Supplementary files
Statistics

Abstract

Currently, the Internet and social networks as a medium for the distribution of digital network content are becoming one of the most important threats to personal, public and state information security. There is a need to protect the individual, society and the state from inappropriate information. In scientific and methodological terms, the problem of protection from inappropriate information has an extremely small number of solutions. This determines the relevance of the results presented in the article, aimed at developing an intelligent system of analytical processing of digital network content to protect against inappropriate information. The article discusses the conceptual foundations of building such a system, revealing the content of the concept of inappropriate information and representing the overall architecture of the system. Models and algorithms for the functioning of the most characteristic components of the system are given, such as a distributed network scanning component, a multidimensional classification component of network information objects, a component for eliminating incompleteness and inconsistency, and a decision-making component. The article presents the results of the implementation and experimental evaluation of system components, which demonstrated the ability of the system to meet the requirements for the completeness and accuracy of detection and counteraction of unwanted information in conditions of its incompleteness and inconsistency.

Keywords

intelligent system, digital network content, inappropriate information, classification, fuzzy knowledge, decision making

References

Scott J. Social Network Analysis: Developments, Advances, and Prospects // Social Network Analysis and Mining. 2011. vol. 1. no. 1. pp. 21-26.
Jebari C. A pure URL-based genre classification of web pages // Proceedings of the 25th International Workshop on Database and Expert Systems Applications. 2014. pp. 233–237.
Kotenko I., Chechulin A., Komashinsky D. Categorisation of Web Pages for Protection against Inappropriate Content in the Internet // International Journal of Internet Protocol Technology (IJIPT). 2017. vol. 10. no. 1. pp. 61-71.
Vaismoradi M., Turunen H., Bondas T. Content Analysis and Thematic Analy-sis: Implications for Conducting a Qualitative Descriptive Study // Nursing & Health Sciences. 2013. vol. 15. no. 3. pp. 398-405.
Defranco J.F., Laplante Ph.A. A Content Analysis Process for Qualitative Software Engineering Research // Innov. Syst. Softw. Eng. 2017. vol. 13. no. 2-3. pp. 129-141.
Boettger R.K., Palmer L.A. Quantitative Content Analysis: Its Use in Technical Communication // IEEE Transactions on Professional Communication. 2010. vol. 53. no. 4. pp. 346-357.
Linhares R.N., Costa A.P. The use of qualitative data analysis software in brazilian educational papers // Proceedings of the International Conference in Engineering Applications (ICEA). 2019. pp. 1–7.
Pashakhanlou H. Fully Integrated Content Analysis in International Relations // International Relations. 2017. vol. 31. no. 4. pp. 447-465.
Timmermans S., Iddo T. Theory Construction in Qualitative Research: From Grounded Theory to Abductive Analysis // Sociological Theory. 2012. vol. 30. no. 3. pp. 167-186.
Gunawan T.S., Abdullah N.A.J., Kartiwi M., Ihsanto E. Social network analysis using python data mining // Proceedings of the 8th International Conference on Cyber and IT Service Management (CITSM). 2020. pp. 1–6.
UCINET documentation. URL: sites.google.com/site/ ucinetsoftware/document (дата доступа: 29.07.2021).
Du W. Toward semantic social network analysis for business big data // Proceedings of the 14th International Conference on Semantics, Knowledge and Grids (SKG). 2018. pp. 1–8.
Li H., Zhang Z., Xu Y. Web page classification method based on semantics and structure // Proceedings of the 2nd International Conference on Artificial Intelligence and Big Data (ICAIBD). 2019. pp. 238–243.
Patil A., Pawar B. Automated classification of web sites using Naive Bayessian algorithm // Proceedings of the International Multi-Conference of Engineers and Computer Scientists. 2012. vol. 1. pp. 466–467.
Kotenko I., Chechulin A., Shorov A., Komashinsky D. Analysis and evaluation of web pages classification techniques for inappropriate content blocking // Proceedings of the 14th Industrial Conference on Data Mining (ICDM 2014). Lecture Notes in Artificial Intelligence. 2014. vol. 8557. pp. 39–54.
Shibu S., Vishwakarma A., Bhargava N. A Combination Approach for Web Page Classification using Page Rank and Feature Selection Technique // International Journal of Computer Theory and Engineering. 2010. vol. 2. no. 6. pp. 897-900.
Xu Z., Yan F., Qin J., Zhu H. A web page classification algorithm based on link information // Proceedings of the 10th International Symposium on Distributed Computing and Applications to Business, Engineering and Science. 2011. pp. 82–86.
Hashemi M. Web Page Classification: A Survey of Perspectives, Gaps, and Future Directions // Multimed. Tools Appl. 2020. vol. 79. pp. 11921-11945.
Patel A.D., Pandya V.N. Web page classification based on context to the con-tent extraction of articles // Proceedings of the 2nd International Conference for Convergence in Technology (I2CT). 2017. pp. 539–541.
Arya C., Dwivedi S.K. News web page classification using URL content and structure attributes // Proceedings of the 2nd International Conference on Next Generation Computing Technologies (NGCT). 2016. pp. 317–322.
Safae L., Habib B. E., Abderrahim T. A Review of machine learning algorithms for web page classification // Proceedings of the 5th International Congress on Information Science and Technology (CiSt). 2018. pp. 220–226.
Aydın K.E., Baday S. Machine learning for web content classification // Proceedings of the Innovations in Intelligent Systems and Applications Conference (ASYU). 2020. pp. 1–7.
Petprasit W., Jaiyen S. E-commerce web page classification based on automatic content extraction // Proceedings of the 12th International Joint Conference on Computer Science and Software Engineering (JCSSE). 2015. pp. 74–77.
Belmouhcine A., Idrissi A., Benkhalifa M. Web Classification Approach Using Reduced Vector Representation Model Based on HTML Tags // Journal of Theoretical and Applied Information Technology. 2013. vol. 55. no. 1. pp. 137-148.
Kotenko I., Chechulin A., Komashinsky D. Evaluation of text classification techniques for inappropriate web content blocking // Proceedings of the IEEE 8th International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS'2015). 2015. pp. 412–417.
Novozhilov D., Kotenko I., Chechulin A. Improving the categorization of web sites by analysis of html-tags statistics to block inappropriate content // Proceedings of the 9th International Symposium on Intelligent Distributed Computing (IDC’2015). 2016. pp. 257–263.
Mishra M., Srivastava M. A view of artificial neural network // Proceedings of the International Conference on Advances in Engineering & Technology Research (ICAETR - 2014). 2014, pp. 1–3.
Mehlig B. Artificial Neural Networks. University of Gothenburg, Sweden. 2019.
Burghardt F., Garbe R. Introduction of artificial neural networks in EMC // Proceedings of the IEEE Symposium on Electromagnetic Compatibility, Signal Integrity and Power Integrity (EMC, SI & PI). 2018. pp. 165–169.
Parashchuk I.B. System formation algorithm of communication network quality factors using artificial neural networks // Proceedings of the 1st IEEE International Conference on Circuits and System for Communications (ICCSC’02). 2002. pp. 263–266.
Pandey K., Bhanacharjee S., Lau S., Tushir M. A Comparative study of fuzzy systems and neural networks for system modeling and identification // Proceedings of the 2nd IEEE International Conference on Power Electronics, Intelligent Control and Energy Systems (ICPEICES). 2018. pp. 876–880.
Агеев С.А., Саенко И.Б. Управление безопасностью защищенных мульти-сервисных сетей специального назначения // Труды СПИИРАН. 2010. № 2(13). С. 182–198.
Kotenko I., Parashchuk I., Omar T. Neuro-fuzzy models in tasks of intelligent data processing for detection and counteraction of inappropriate, dubious and harmful information // Proceedings of the 2nd International Scientific-Practical Conference Fuzzy Technologies in the Industry. 2018. pp. 116–125.
Нугуманова А.Б., Бессмертный И.А., Пецина П., Байбурин Е.М. Обогаще-ние модели Bag of words семантическими связями для повышения качества классификации текстов предметной области // Программные продукты и системы. 2016. №. 2 (114). С. 89–99.
Mikolov T., Chen K., Corrado G., Dean J. Efficient estimation of word representations in vector space // arXiv preprint arXiv:1301.3781. 2013. pp. 1–12.
SquidGuard – Blacklists. URL: www.squidguard.org/blacklists.html (дата доступа: 29.07.2021).
Shalla Secure Services. Shalla's Blacklists. URL: www.shallalist.de/ (дата доступа: 29.07.2021).
DMOZ. Archive. URL: dmoz-odp.org/ (дата доступа: 29.07.2021).
Joulin A., Grave E., Bojanowski P., Mikolov T. Bag of tricks for efficient text classification // arXiv preprint arXiv:1607.01759. 2016. pp. 1–5.
Браницкий А.А., Котенко И.В. Обнаружение сетевых атак на основе комплексирования нейронных, иммунных и нейронечетких классификаторов // Информационно-управляющие системы. 2015. № 4 (77). С. 69–77.
Паращук И.Б., Башкирцев А.С., Михайличенко Н.В. Анализ уровней и видов неопределенности, влияющей на принятие решений по управлению информационными системами // Информация и космос. 2017. № 1. С. 112–120.

Supplementary files

Supplementary Files

Action

1. JATS XML

Download

Username
Password
Remember me

Forgot password?	Register

Username
Password
Remember me

Forgot password?	Register

Vol 24, No 5 (2025)

Vol 24, No 5 (2025)

Intelligent System of Analytical Processing of Digital Network Content for Protection Against Inappropriate Information

Full Text

Abstract

Keywords

About the authors

I. V Kotenko

I. B Saenko

A. A Branitskiy

I. B Parashchuk

D. A Gaifulina

References

Supplementary files