Issue |
JNWPU
Volume 39, Number 4, August 2021
|
|
---|---|---|
Page(s) | 909 - 918 | |
DOI | https://doi.org/10.1051/jnwpu/20213940909 | |
Published online | 23 September 2021 |
Optimization of correlate subquery based on distributed database
基于分布式数据库的相关子查询优化
1
School of Computer Science, Northwestern Polytechnical University, Xi’an 710072, China
2
Bank of Communications, Shanghai 200120, China
Received:
25
November
2020
Subquery is widely used in database. It can be divided into related subquery and non-related subquery according to whether it is dependent on the table of the parent query. For related subqueries, it is necessary to take a tuple from the parent query before executing the subquery, that is, the content of the subquery needs to be repeatedly operated. Disk access costs of this strategy is very big, in the distributed database, because of data communication overhead, in the parent query yuan set is too low efficiency, therefore, for the class sub queries, on the basis of the optimization of the existing query strategy, combining with the characteristics of distributed database, put forward by the subquery on to join queries, eliminate redundant clauses in the subquery, eliminate accumulation function method based on distributed database query optimization strategy, and the effectiveness of the present optimization strategy is verified by experiment.
摘要
子查询在数据库中的应用较为广泛,根据是否与父查询的表有依赖关系,可以将其分为相关子查询和非相关子查询。相关子查询需要先从父查询中取出一个元组后执行子查询,即需要反复对子查询的内容进行运算。这种策略的磁盘访问开销很大,在分布式数据库中,由于存在数据通信开销,在父查询元组过多时效率较低。针对该类子查询,在现有的优化查询策略的基础上,结合分布式数据库的特点,提出了通过子查询上拉为连接查询,消除子查询中冗余子句,消除聚集函数等方法实现的基于分布式数据库的子查询优化策略,并通过实验验证了所提优化策略的有效性。
Key words: Distributed database / Correlate subquery optimization / rule-based optimization
关键字 : 分布式数据库 / 相关子查询优化 / 基于规则的优化
© 2021 Journal of Northwestern Polytechnical University. All rights reserved.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.