Title
Cloud Based Predictive Analytics: Text Classification, Recommender Systems and Decision Support
Abstract
This paper presents a detailed study of technologies based on Hadoop and MapReduce available over the cloud for large-scale data mining and predictive analytics. Although some studies may have shown that cloud technologies relying on the MapReduce framework do not perform as well as parallel database management systems, e.g., with ad hoc queries and interactive applications, MapReduce has still been widely used by many organizations for big data storage and analytics. A number of MapReduce based tools are broadly available over the cloud. In this work we explore the Apache Hive data warehousing solution and particularly its Mahout data mining libraries for predictive analytics. We present results in the context of text classification, recommender systems and decision support. We develop prototype tools in these areas and discuss our outcomes from the study useful to researchers and other professionals in cloud computing and application domains. To the best of our knowledge, ours is among the first few in-depth studies on Mahout with application prototypes available for use.
Year
DOI
Venue
2013
10.1109/ICDMW.2013.95
Data Mining Workshops
Keywords
Field
DocType
application domain,cloud technology,cloud computing,mahout data mining library,text classification,decision support,apache hive data warehousing,detailed study,predictive analytics,recommender systems,mapreduce framework,large-scale data mining,big data storage,data mining,data warehouses,text analysis,decision support systems
Recommender system,Data science,Data warehouse,Data mining,Predictive analytics,Computer science,Decision support system,Analytics,Business intelligence,Big data,Cloud computing
Conference
ISSN
ISBN
Citations 
2375-9232
978-1-4799-3143-9
5
PageRank 
References 
Authors
0.43
6
2
Name
Order
Citations
PageRank
Klavdiya Hammond150.43
Aparna S. Varde218828.71