Title
Living on the Edge: Efficient Handling of Large Scale Sensor Data
Abstract
Real-time sensor monitoring is critical in many industrial applications and is, e.g., used to model and predict operating conditions to optimize operations as well as to prevent damage in machinery and systems. In many cases, this data is generated by a myriad of sensors and stored or transmitted for post-processing by data analysts. Handling this data near its origin-on the edge-imposes significant challenges for storage and compression: it is necessary to store it in a format that is suitable for large data analytics algorithms, which in most cases means columnar storage. Furthermore, to provide efficient storage and transmission of such sensor data, it must be compressed efficiently. However, existing solutions do not address these challenges sufficiently. In this work, we present a holistic approach for fast streaming of large scale sensor data directly into columnar storage and integrate it with a proven compression scheme. Our approach uses a pipelined scheme for streaming and transposing the data layout, combined with a byte-level transformation of data representation and compression, which we evaluate in comprehensive experiments. As a result, our approach enables transformation of large scale sensor data streams into an efficient, analytics-friendly format already at the sensor site, i.e., on the edge, at data ingestion time. By implementing our optimized approach in the open and widely used columnar storage format Apache Parquet, which we already partly upstreamed, we ensure its accessibility to the community.
Year
DOI
Venue
2021
10.1109/CCGrid51090.2021.00010
2021 IEEE/ACM 21st International Symposium on Cluster, Cloud and Internet Computing (CCGrid)
Keywords
DocType
ISBN
sensor data streaming,edge computing
Conference
978-1-7281-9587-2
Citations 
PageRank 
References 
1
0.38
0
Authors
6
Name
Order
Citations
PageRank
Roman Karlstetter110.38
Amir Raoofy210.38
Martin Radev310.38
Carsten Trinitis415129.80
Jakob Hermann510.38
Martin Schulz616719.77