Title
Clustering massive small data for IOT
Abstract
Data of IOT (Internet of things) have characteristics of heterogeneity, massive, timeliness and other features, which indicate that much of its data is in the form of small files. Cloud computing is used to deal with large data sets, but a large number of small data sets in the system will occupy most of the resources, resulting in a waste of system resources. In this paper, according to the characteristics of the mass of small data sets, and the deficiency of HDFS handle huge amounts of small data sets. This paper uses MapReduce to analysis the numerous small data sets and proposes a cluster strategy for massive small data based on the k-means clustering algorithm. The experimental results show that the proposed strategy can improve the data processing efficiency, and can improve the utilization of system resources. The research fruits will help us to design more practical merger strategy of massive small data to provide research reference.
Year
DOI
Venue
2014
10.1109/ICSAI.2014.7009427
ICSAI
Keywords
Field
DocType
internet of things,cloud computing,data analysis,parallel processing,pattern clustering,resource allocation,hdfs,iot,mapreduce,cluster strategy,data processing efficiency,data set analysis,k-means clustering algorithm,massive small data clustering,merger strategy,system resource utilization,clustering,k-means,clustering algorithms,algorithm design and analysis
Small files,Data mining,Data processing,Data set,Algorithm design,Small data,Computer science,Internet of Things,Cluster analysis,Cloud computing
Conference
Citations 
PageRank 
References 
2
0.40
7
Authors
2
Name
Order
Citations
PageRank
Xin Tao183.87
Chunlei Ji221.75