Title
A versatile data-intensive computing platform for information retrieval from big geospatial data.
Abstract
The increasing amount of free and open geospatial data of interest to major societal questions calls for the development of innovative data-intensive computing platforms for the efficient and effective extraction of information from these data. This paper proposes a versatile petabyte-scale platform based on commodity hardware and equipped with open-source software for the operating system, the distributed file system, and the task scheduler for batch processing as well as the containerization of user specific applications. Interactive visualization and processing based on deferred processing are also proposed. The versatility of the proposed platform is illustrated with a series of applications together with their performance metrics.
Year
DOI
Venue
2018
10.1016/j.future.2017.11.007
Future Generation Computer Systems
Field
DocType
Volume
Geospatial analysis,Distributed File System,Information retrieval,Data-intensive computing,Computer science,Interactive visualization,Software,Batch processing,Commodity hardware,Distributed computing
Journal
81
Issue
ISSN
Citations 
C
0167-739X
3
PageRank 
References 
Authors
0.61
10
7
Name
Order
Citations
PageRank
Pierre Soille130.95
A. Burger230.61
D. De Marchi330.61
Pieter Kempeneers47611.04
D. Rodriguez530.61
Vassilis Syrris630.61
V. Vasilev730.61