Title
BIG DATA DEVELOPMENT PLATFORM FOR ENGINEERING APPLICATIONS
Abstract
The present study utilizes VirtualBox virtual environment technology to develop the Personal, small size, Big Data platform that can effectively replicate a VM Hadoop system and provides an environment for developers to easily design and implement Hadoop Map/Reduce programming. This study also performs the benchmark by using the VM Hadoop, small-cluster Hadoop, and NCHC’s large-scale Hadoop cluster, Braavos. The benchmark results show that the VM Hadoop is an ideal platform for the Map/Reduce code development and testing purpose, and the Braavos Hadoop cluster is the most appropriate for production runs. Moreover, based on the standard WordCount example, the computing time of Braavos Hadoop cluster is 232 times faster than the small-cluster Hadoop. In addition, an engineering example, the image recognition of flow monitoring, is given to illustrate the way of big image data analytics in the Hadoop system. Finally, the VM Hadoop, in term of a Big Data development platform, is ready for users to download. The first author of this paper would like to give a demonstration for the proposed VM Hadoop system as well as an engineering application.
Year
Venue
Keywords
2016
BigData
Big data,Hadoop,In-place Computation,MapReduce,Personal Platform,Engineering Application
Field
DocType
Citations 
Data mining,Virtual machine,Data analysis,Computer science,Big data,Benchmark (computing),Operating system,Code development,Virtual machining
Conference
2
PageRank 
References 
Authors
0.77
2
6
Name
Order
Citations
PageRank
Chien-Heng Wu131.82
Whey-Fone Tsai243.85
Franco Lin342.16
Wen-Yi Chang4123.34
Hsi-ching Lin521.45
Chao-Tung Yang61196139.50