Parallel processing of very large databases using distributed column indexes
- 作者: Ivanova E.V.1, Sokolinsky L.B.1
-
隶属关系:
- South Ural State University
- 期: 卷 43, 编号 3 (2017)
- 页面: 131-144
- 栏目: Article
- URL: https://journals.rcsi.science/0361-7688/article/view/176501
- DOI: https://doi.org/10.1134/S0361768817030069
- ID: 176501
如何引用文章
详细
The development and investigation of efficient methods of parallel processing of very large databases using the columnar data representation designed for computer cluster is discussed. An approach that combines the advantages of relational and column-oriented DBMSs is proposed. A new type of distributed column indexes fragmented based on the domain-interval principle is introduced. The column indexes are auxiliary structures that are constantly stored in the distributed main memory of a computer cluster. To match the elements of a column index to the tuples of the original relation, surrogate keys are used. Resource hungry relational operations are performed on the corresponding column indexes rather than on the original relations of the database. As a result, a precomputation table is obtained. Using this table, the DBMS reconstructs the resulting relation. For basic relational operations on column indexes, methods for their parallel decomposition that do not require massive data exchanges between the processor nodes are proposed. This approach improves the class OLAP query performance by hundreds of times.
作者简介
E. Ivanova
South Ural State University
编辑信件的主要联系方式.
Email: Elena.Ivanova@susu.ru
俄罗斯联邦, Chelyabinsk, 454080
L. Sokolinsky
South Ural State University
Email: Elena.Ivanova@susu.ru
俄罗斯联邦, Chelyabinsk, 454080
补充文件
