Title
Spatiotemporal air quality inference of low-cost sensor data: Evidence from multiple sensor testbeds
Abstract
Recent advances in sensor and IoT technologies allow for denser and mobile air quality measurements. These measurements are still spatiotemporally sparse at city-level, but can be interpolated using data-driven techniques. This work presents validation results of two machine-learning models to infer air quality sensor data in both space and time. Temporal validation exercises are performed at available regulatory monitoring stations following the FAIRMODE protocol. Both models show scalable to different mobile datasets with comparable prediction performance for PM2.5 (R2 = 0.68–0.75, MAE = 2.99–2.82 μg m−3) and NO2 (R2 = 0.8–0.82, MAE = 8.81–9.83 μg m−3) in Utrecht and Antwerp. In Oakland (Atlanta), we observed a lower performance for NO2 (R2 = 0.46–0.41, MAE = 4.06–5.07) and BC (R2 = 0.31–0.28, MAE = 0.48–0.27), likely caused by the less representative monitoring coverage. Although comparable in terms of prediction performance, the Geographical Random Forest (GRF) model seems to achieve slightly better accuracies, while the correlations are typically higher for the Air Variational Graph Autoencoder (AVGAE) model. This work demonstrates the potential of data-driven techniques for spatiotemporal air quality inference of complementary sensor data. The observed performance metrics approach current state-of-the-art chemical transport models in terms of performance while needing much lower resources, computational power, infrastructure and processing time.
Year
DOI
Venue
2022
10.1016/j.envsoft.2022.105306
Environmental Modelling & Software
Keywords
DocType
Volume
IoT,Urban,Air quality,Mobile,Sensors,Machine learning
Journal
149
ISSN
Citations 
PageRank 
1364-8152
0
0.34
References 
Authors
0
7