PUKYONG

Enhanced R-tree Bulk Loading Scheme Using Map-Reduce Framework

Metadata Downloads
Abstract
An R-tree is an index structure that enables fast accesses to multi-dimensional data. Constructing an R-tree for a given data set yields an efficient R-tree structure than incrementally building one as data are inserted. However it usually takes a lot of time constructing an R-tree for a huge volume of data. In this paper, we propose a parallel R-Tree construction scheme based on a Hadoop framework. The proposed scheme divides the data into partitions, builds local R-trees in parallel, and merges them to construct an R-tree that covers whole data set. While generating the partitions, it considers the data distribution so that each partitions have nearly equal amount of data. Therefore the proposed scheme gives an efficient index structure while reducing the construction tim
Author(s)
HUYNH CONG VIET NGU
Issued Date
2017
Awarded Date
2017. 2
Type
Dissertation
Keyword
R-tree Bulk Map-Reduce Framework
Publisher
부경대학교 대학원
URI
https://repository.pknu.ac.kr:8443/handle/2021.oak/13514
http://pknu.dcollection.net/jsp/common/DcLoOrgPer.jsp?sItemId=000002333215
Affiliation
부경대학교 대학원
Department
대학원 IT융합응용공학과
Advisor
송하주
Table Of Contents
I. Introduction 1
II. Related Works 4
2.1 Big Data 4
2.1.1 Hadoop-MapReduce 4
2.2 Quality of R-tree 6
2.3 Parallel R-tree Construction using MapReduce 8
2.4 R-tree Packing Algorithm 9
2.4.1 Z-order Curve 10
2.4.2 Sort-Tile-Recursive 11
III. Parallel R-tree Construction using Hadoop 13
3.1 Overview 13
3.2 Data Partitioning 15
3.2.1 Description 15
3.2.2 Proposed MapReduce Algorithm 16
3.3 R-Tree Construction 20
3.3.1 Description 20
3.3.2 Proposed MapReduce Algorithm 20
3.4 R-tree Consolidation 21
IV. Experimental Result 22
4.1 Experimental Environment 22
4.1.1 Hadoop Cluster 22
4.1.2 Data Set 23
4.2 Experimental Result 25
4.2.1 Time Performance 25
4.2.1.1 Our Approach 25
4.2.1.2 Comparison with Z-curve Method 26
4.2.1.2.1 Data Partitioning Phase 27
4.2.1.2.2 R-tree Construction Phase 29
4.2.2 Quality of Generated R-tree 32
4.2.2.1 Area Comparison 32
4.2.2.1.1 Comparison with Single R-tree 32
4.2.2.1.2 Comparison with Z-curve Method 33
4.2.2.2 Overlap Comparison 36
4.2.2.2.1 Comparison with Z-curve Method 36
V. Conclusion 38
References 40
Acknowledgement 41
Degree
Master
Appears in Collections:
대학원 > IT융합응용공학과
Authorize & License
  • Authorize공개
Files in This Item:

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.