<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE root>
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:ali="http://www.niso.org/schemas/ali/1.0/" article-type="research-article" dtd-version="1.2" xml:lang="en"><front><journal-meta><journal-id journal-id-type="publisher-id">Computational nanotechnology</journal-id><journal-title-group><journal-title xml:lang="en">Computational nanotechnology</journal-title><trans-title-group xml:lang="ru"><trans-title>Computational nanotechnology</trans-title></trans-title-group></journal-title-group><issn publication-format="print">2313-223X</issn><issn publication-format="electronic">2587-9693</issn><publisher><publisher-name xml:lang="en">YUR-VAK</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="publisher-id">309697</article-id><article-id pub-id-type="doi">10.33693/2313-223X-2025-12-1-26-33</article-id><article-id pub-id-type="edn">LNPTVP</article-id><article-categories><subj-group subj-group-type="toc-heading" xml:lang="en"><subject>INFORMATION TECHNOLOGY AND TELECOMMUNICATION</subject></subj-group><subj-group subj-group-type="toc-heading" xml:lang="ru"><subject>ИНФОРМАЦИОННЫЕ ТЕХНОЛОГИИ И ТЕЛЕКОММУНИКАЦИИ</subject></subj-group><subj-group subj-group-type="article-type"><subject>Research Article</subject></subj-group></article-categories><title-group><article-title xml:lang="en">Comparative Analysis of HDFS and Apache Ozone Data Storage Systems</article-title><trans-title-group xml:lang="ru"><trans-title>Сравнительный анализ систем хранения данных HDFS и Apache Ozone</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author"><contrib-id contrib-id-type="orcid">https://orcid.org/0009-0003-2723-3154</contrib-id><contrib-id contrib-id-type="researcherid">IAN-1730-2023</contrib-id><contrib-id contrib-id-type="spin">1380-5720</contrib-id><name-alternatives><name xml:lang="en"><surname>Ievlev</surname><given-names>Kirill O.</given-names></name><name xml:lang="ru"><surname>Иевлев</surname><given-names>Кирилл Олегович</given-names></name></name-alternatives><address><country country="RU">Russian Federation</country></address><bio xml:lang="en"><p>Postgraduate Student, Assistant of the Department of Mathematical Cybernetics and Information Technologies</p></bio><bio xml:lang="ru"><p>аспирант, ассистент кафедры математической кибернетики и информационных технологий</p></bio><email>ievlev.k.o@yandex.ru</email><xref ref-type="aff" rid="aff1"/></contrib><contrib contrib-type="author"><contrib-id contrib-id-type="orcid">https://orcid.org/0000-0003-1739-9831</contrib-id><contrib-id contrib-id-type="scopus">55836031600</contrib-id><contrib-id contrib-id-type="researcherid">D-3256-2019</contrib-id><contrib-id contrib-id-type="spin">4576-9642</contrib-id><name-alternatives><name xml:lang="en"><surname>Gorodnichev</surname><given-names>Mikhail G.</given-names></name><name xml:lang="ru"><surname>Городничев</surname><given-names>Михаил Геннадьевич</given-names></name></name-alternatives><address><country country="RU">Russian Federation</country></address><bio xml:lang="en"><p>Cand. Sci. (Eng.), Associate Professor, Head of the Department of Mathematical Cybernetics and Information Technologies, Dean of the Faculty of Information Technologies</p></bio><bio xml:lang="ru"><p>кандидат технических наук, доцент, заведующий кафедры математической кибернетики и информационных технологий, декан факультета информационных технологий</p></bio><email>m.g.gorodnichev@mtuci.ru</email><xref ref-type="aff" rid="aff1"/></contrib></contrib-group><aff-alternatives id="aff1"><aff><institution xml:lang="en">Moscow Technical University of Communications and Informatics</institution></aff><aff><institution xml:lang="ru">Московский технический университет связи и информатики</institution></aff></aff-alternatives><pub-date date-type="pub" iso-8601-date="2025-06-19" publication-format="electronic"><day>19</day><month>06</month><year>2025</year></pub-date><volume>12</volume><issue>1</issue><fpage>26</fpage><lpage>33</lpage><history><date date-type="received" iso-8601-date="2025-09-18"><day>18</day><month>09</month><year>2025</year></date></history><permissions><copyright-statement xml:lang="en">Copyright ©; 2025, Yur-VAK</copyright-statement><copyright-statement xml:lang="ru">Copyright ©; 2025, Юр-ВАК</copyright-statement><copyright-year>2025</copyright-year><copyright-holder xml:lang="en">Yur-VAK</copyright-holder><copyright-holder xml:lang="ru">Юр-ВАК</copyright-holder><license><ali:license_ref xmlns:ali="http://www.niso.org/schemas/ali/1.0/">https://www.urvak.ru/contacts/</ali:license_ref></license></permissions><self-uri xlink:href="https://journals.rcsi.science/2313-223X/article/view/309697">https://journals.rcsi.science/2313-223X/article/view/309697</self-uri><abstract xml:lang="en"><p>Over the last few decades, both the volume of digital data in the globe and the variety of ways to use it have increased dramatically. For a long time, the Hadoop ecosystem, which is still widely utilized, has been synonymous with large data storage and processing platforms. However, during the past 20 years, Hadoop has been found to have a number of serious flaws, including the “small files problem” and uneven cluster resource usage. Various commercial and research organizations are faced with the issue of upgrading the data stack to improve resource utilization and increasing data processing efficiency. <italic>This study aims</italic> to examine the benefits and drawbacks of the next-generation data storage system, Apache Ozone, and to assess whether this technology is ready to completely supplant the Hadoop Distributed File System (HDFS).</p></abstract><trans-abstract xml:lang="ru"><p>За последние десятилетия значительно выросло не только количество цифровых данных в мире, но и способов их использования. Пионером и долгое время синонимом платформы для хранения и обработки больших данных являлась экосистема Hadoop, которая и по сей день активно используется во множестве крупнейших компаний. Однако, за почти 20 лет, прошедших с первого релиза Hadoop, был выявлен ряд существенных недостатков, такие как «проблема маленьких файлов» и неравномерное использование ресурсов кластеров. Во многих коммерческий и исследовательских организациях встает вопрос о модернизации стека работы с данными для повышения утилизации ресурсов и расширения возможностей для эффективной работы с данными. <italic>Цель данной работы</italic> – продемонстрировать достоинства и недостатки хранилища данных нового поколения – Apache Ozone и сделать вывод о готовности технологии для полноценной замены распределенной файловой системы Hadoop (HDFS).</p></trans-abstract><kwd-group xml:lang="en"><kwd>big data storage</kwd><kwd>distributed file systems</kwd><kwd>object storage</kwd><kwd>S3</kwd><kwd>Apache Hadoop</kwd><kwd>Apache Ozone</kwd></kwd-group><kwd-group xml:lang="ru"><kwd>хранение больших данных</kwd><kwd>распределенные файловые системы</kwd><kwd>объектное хранилище</kwd><kwd>S3</kwd><kwd>Apache Hadoop</kwd><kwd>Apache Ozone</kwd></kwd-group><funding-group/></article-meta></front><body></body><back><ref-list><ref id="B1"><label>1.</label><mixed-citation>Aggarwal R., Verma J., Siwach M. Small files’ problem in Hadoop: A systematic literature review. Journal of King Saud University “Computer and Information Sciences”. 2022. No. 34 (10). Part A. Pp. 8658–8674. DOI: 10.1016/j.jksuci.2021.09.007.</mixed-citation></ref><ref id="B2"><label>2.</label><mixed-citation>Harby A.A., Zulkernine F. From data warehouse to lakehouse: A comparative review. In: IEEE International Conference on Big Data (Big Data). Osaka, 2022. Pp. 389–395. DOI: 10.1109/BigData55660.2022.10020719.</mixed-citation></ref><ref id="B3"><label>3.</label><mixed-citation>Jain E.P., Gupta E.A. Hadoop architecture and its issues. International Journal of Engineering Research and General Science. 2017. No. 5 (2). Pp. 211–217. DOI: 10.1109/CSCI.2014.140.</mixed-citation></ref><ref id="B4"><label>4.</label><mixed-citation>Niazi S., Ismail M., Haridi S. et al. HopsFS: Scaling Hierarchical File System Metadata Using NewSQL Databases. In: 15th USENIX Conference on File and Storage Technologies (FAST 17). USENIX Association, 2017. Pp. 89–104. DOI: 10.48550/arXiv.1606.01588.</mixed-citation></ref><ref id="B5"><label>5.</label><mixed-citation>Sharma G., Tripathi V., Srivastava A. Recent trends in Big Data ingestion tools: A study. In: Research in Intelligent and Computing in Engineering, Springer, 2021. Pp. 873–881. DOI: 10.1007/978-981-15-7527-3_83.</mixed-citation></ref><ref id="B6"><label>6.</label><mixed-citation>Shvachko K. HDFS scalability: The limits to growth. Login Usenix Mag. 2010. No. 35. Pp. 6–16.</mixed-citation></ref><ref id="B7"><label>7.</label><mixed-citation>White T. Hadoop: The definitive guide. 4 ed. O’Reilly Media, Inc., 2015. 754 p.</mixed-citation></ref></ref-list></back></article>
