Title
High-Performance Overlay Analysis of Massive Geographic Polygons That Considers Shape Complexity in a Cloud Environment.
Abstract
Overlay analysis is a common task in geographic computing that is widely used in geographic information systems, computer graphics, and computer science. With the breakthroughs in Earth observation technologies, particularly the emergence of high-resolution satellite remote-sensing technology, geographic data have demonstrated explosive growth. The overlay analysis of massive and complex geographic data has become a computationally intensive task. Distributed parallel processing in a cloud environment provides an efficient solution to this problem. The cloud computing paradigm represented by Spark has become the standard for massive data processing in the industry and academia due to its large-scale and low-latency characteristics. The cloud computing paradigm has attracted further attention for the purpose of solving the overlay analysis of massive data. These studies mainly focus on how to implement parallel overlay analysis in a cloud computing paradigm but pay less attention to the impact of spatial data graphics complexity on parallel computing efficiency, especially the data skew caused by the difference in the graphic complexity. Geographic polygons often have complex graphical structures, such as many vertices, composite structures including holes and islands. When the Spark paradigm is used to solve the overlay analysis of massive geographic polygons, its calculation efficiency is closely related to factors such as data organization and algorithm design. Considering the influence of the shape complexity of polygons on the performance of overlay analysis, we design and implement a parallel processing algorithm based on the Spark paradigm in this paper. Based on the analysis of the shape complexity of polygons, the overlay analysis speed is improved via reasonable data partition, distributed spatial index, a minimum boundary rectangular filter and other optimization processes, and the high speed and parallel efficiency are maintained.
Year
DOI
Venue
2019
10.3390/ijgi8070290
ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION
Keywords
Field
DocType
overlay analysis,shape complexity,massive data,cloud,parallel computing
Spatial analysis,Graphics,Geographic information system,Polygon,Spark (mathematics),Computer science,Computer graphics,Spatial database,Distributed computing,Cloud computing
Journal
Volume
Issue
ISSN
8
7
2220-9964
Citations 
PageRank 
References 
1
0.36
0
Authors
6
Name
Order
Citations
PageRank
Kang Zhao110.36
Baoxuan Jin210.36
Hong Fan3114.96
Weiwei Song4426.05
Sunyu Zhou510.36
Yuanyi Jiang610.36