Abstract | ||
---|---|---|
In recent years, due to the frequent occurrence of extreme rainfall events, the intensity of rainfall characteristics in Taiwan has been changed. In the event of heavy rainfall, it is worth our attention for a long time. Through the big data analysis platform of Apache Spark and R language, this paper uses the random forest algorithm to construct the regional rainfall patterns. in order to obtain the meteorological information on the Internet, datasets are retrieved from the Pinglin Weather Station of Central Meteorological Administration of Taiwan's Ministry of Communications. The meteorological data such as rainfall, temperature and humidity at the weather station will be crawled from the web first. After pre-processing meteorological data, this paper applies random forest algorithm on the platform of Apache Spark to calculate and analyze the results. From simulation results, the obtained root mean square error of test data for the platform Apache Spark has better results than traditional platform. |
Year | DOI | Venue |
---|---|---|
2018 | 10.1109/ICACI.2018.8377482 | 2018 Tenth International Conference on Advanced Computational Intelligence (ICACI) |
Keywords | Field | DocType |
big data,spark,random forest,decision tree | Meteorology,Spark (mathematics),Weather station,Mean squared error,Environmental science,Test data,Landslide,Random forest,Big data,Precipitation | Conference |
ISBN | Citations | PageRank |
978-1-5386-4363-1 | 0 | 0.34 |
References | Authors | |
3 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Chou-Yuan Lee | 1 | 295 | 17.36 |
Jian-Qiong Huang | 2 | 0 | 0.34 |
Wei-Ping Ma | 3 | 0 | 0.34 |
Yulin Weng | 4 | 4 | 1.46 |
Yuan-Chih Lee | 5 | 0 | 0.34 |
Zne-Jung Lee | 6 | 940 | 43.45 |