Improve Search Performance for R+ tree Learned Spatial Index
DOI:
https://doi.org/10.25098/7.1.3Keywords:
Learned index, R tree, nearest neighbor spatial queryAbstract
The R+ tree algorithm is used to give index to each spatial data sets of (point, line polygon spatial objects). Our map datasets contain 1048575 records for point, as well as for line and polygon each. Machine learning approach used to learn the indices. Then nearest neighbor queries (NNQ) are carried out in purpose of evaluating system performance for proposed learned index. The evaluation of proposed indexing done based on query execution time. Execution times of k-Nearest neighbor query in learned methods for all kind of spatial objects (point, line, and polygon) are calculated, that 25 random sample points were took for each of them. In order to compare learnt and conventional indexing, we also developed the traditional indexing using the R+ tree technique. So, a significant finding that goes beyond the suggested approach (R+ Learned Spatial Index (R+ LSI)) is that we reached less execution times in learned index method for nearest neighbor queries for all three spatial objects. Getting benefits of Microsoft SQL Server database for handling spatial objects indices. We propose that machine learning (ML) allows for the independent creation of specific index structures by making it possible to design a model that reproduce the patterns in the data. We explore the extent to which learning models may replace traditional index structures like R+ tree. Finally measuring the learned index (R+ LSI) is done by getting benefits of both parameters which are, mean square error (MSE) and R-square (R2). The estimator is linear regression which is used in machine learning approach to make a model. Enhancing spatial query performance is done through the minimum execution time of spatial NNQ in this research.
References
Güting, R.H., An introduction to spatial database systems. the VLDB Journal, 1994. 3(4): p. 357-399.
Jia, L., et al., Efficient 3D Hilbert Curve Encoding and Decoding Algorithms. Chinese Journal of Electronics, 2022. 31(2): p. 277-284.
Comer, D., Ubiquitous B-tree. ACM Computing Surveys (CSUR), 1979. 11(2): p. 121-137.
Groh, F., et al., Ggnn: Graph-based gpu nearest neighbor search. IEEE Transactions on Big Data, 2022.
Liu, Y., et al., Research on Hybrid Index Based on 3D Multi-Level Adaptive Grid and R+ Tree. IEEE Access, 2021. 9: p. 146010-146022.
Gu, T., et al., A Reinforcement Learning Based R-Tree for Spatial Data Indexing in Dynamic Environments. arXiv preprint arXiv:2103.04541, 2021.
Sellis, T., N. Roussopoulos, and C. Faloutsos, The R+-Tree: A Dynamic Index for Multi-Dimensional Objects. 1987.
Böhm, C., S. Berchtold, and D.A. Keim, Searching in high-dimensional spaces: Index structures for improving the performance of multimedia databases. ACM Computing Surveys (CSUR), 2001. 33(3): p. 322-373.
Samet, H., The quadtree and related hierarchical data structures. ACM Computing Surveys (CSUR), 1984. 16(2): p. 187-260.
Rosenberg, J.B., Geographical data structures compared: A study of data structures supporting region queries. IEEE transactions on computer-aided design of integrated circuits and systems, 1985. 4(1): p. 53-67.
Hinrichs, K. and J. Nievergelt, The grid file: a data structure designed to support proximity queries on spatial objects. ETH Eidgenössische Technische Hochschule Zürich, Institut für Informatik, 1983. 54.
Kamel, I. and C. Faloutsos. On packing R-trees. in Proceedings of the second international conference on Information and knowledge management. 1993.
Kamel, I. and C. Faloutsos, Hilbert R-tree: An improved R-tree using fractals. 1993.
Zhang, L., et al., Query method for nearest region of spatial line segment based on Hilbert curve grid. International Journal of Innovative Computing Information and Control, 2019. 15(4): p. 1287-1307.
Triebel, R., P. Pfaff, and W. Burgard. Multi-level surface maps for outdoor terrain mapping and loop closing. in 2006 IEEE/RSJ international conference on intelligent robots and systems. 2006. IEEE.
Zhang, H., Y. Dong, and D. Xu, Accelerating exact nearest neighbor search in high dimensional Euclidean space via block vectors. International Journal of Intelligent Systems, 2022. 37(2): p. 1697-1722.
Tian, Y., et al., A Learned Index for Exact Similarity Search in Metric Spaces. arXiv preprint arXiv:2204.10028, 2022.
Guttman, A. R-trees: A dynamic index structure for spatial searching. in Proceedings of the 1984 ACM SIGMOD international conference on Management of data. 1984.
Nathan, V., et al. Learning multi-dimensional indexes. in Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data. 2020.
Ding, J., et al. ALEX: an updatable adaptive learned index. in Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data. 2020.
Janiesch, C., P. Zschech, and K. Heinrich, Machine learning and deep learning. Electronic Markets, 2021. 31(3): p. 685-695.
Kraska, T., et al. The case for learned index structures. in Proceedings of the 2018 international conference on management of data. 2018.
Kang, M.-A. and K.-J. Li. Query processing methods for connectivity search in visual databases using R+-tree. in Working Conference on Visual Database Systems. 1995. Springer.
Singh, H. and S. Bawa, A survey of traditional and mapreducebased spatial query processing approaches. ACM SIGMOD Record, 2017. 46(2): p. 18-29.
Najeeb, G.M. and N.A. Ali, In the Context of Spatial Data, a Comparison between Learning and Traditional Indexing. Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2022. 54(5): p. 1-8.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 The Scientific Journal of Cihan University– Sulaimaniya

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
SJCUS's open access articles are published under a Creative Commons Attribution CC-BY-NC-ND 4.0 license.
