Abstract | ||
---|---|---|
Santa Ana conditions are hot, dry, windy weather conditions that can greatly increase the dangers of wildfires in southern California. We present a machine learning approach to detect Santa Ana conditions based on sensor measurements from weather stations. Cluster analysis is performed on historical weather data to build models to identify Santa Ana patterns. A separate model is built using data from each weather station to capture the patterns specific to the micro-climate of each region. Real-time sensor data from a weather station can then be processed to determine if the region surrounding that station is experiencing Santa Ana conditions. Results can be used as a warning system to focus firefighting efforts on regions with increased wildfire risks. Through the use of the Kepler workflow system and distributed computing with Spark, data from several weather stations can be processed in parallel using a scalable clustering algorithm, allowing our approach to scale to large datasets from multiple weather stations. |
Year | Venue | Keywords |
---|---|---|
2016 | 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | wildfire monitoring, k-means clustering, scientific workflow, distributed computing, location-specific sensor processing |
Field | DocType | Citations |
Meteorology,Warning system,Data mining,Data modeling,Spark (mathematics),Computer science,Simulation,Weather station,Microclimate,Firefighting,Cluster analysis,Scalability | Conference | 0 |
PageRank | References | Authors |
0.34 | 0 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Mai Nguyen | 1 | 11 | 1.63 |
Dylan Uys | 2 | 0 | 0.34 |
Daniel Crawl | 3 | 243 | 21.02 |
Charles Cowart | 4 | 0 | 0.34 |
Ilkay Altintas | 5 | 1191 | 106.09 |