Title
Mining for Norms in Clouds: Complying to Ethical Communication through Cloud Text Data Mining
Abstract
As the world is realizing the power and efficiency of cloud computing, enhanced security and intelligence is needed in communication to filter out unethical data violating norms in clouds. No filtering categorization has been currently proposed. Numerous lists of banned, unethical and objectionable words have been developed with limited user satisfaction. Lists are usually manually generated, with some programmable extensibility for online forums and public newsgroups. We define a tool and methodology to categorize the censor data. We statistically grow words in the categorized data and tag the hidden neutral words with meaning in context. Using Computational Linguistics tools and modifying them to suit our means, we analyze sample text from gigabytes of email newsgroup dataset over Cloud Servers. A sample result dataset of the most frequently used words breaking the norms in recent cloud communication is presented in the results in broad categories. The categories separate cloud-server data found in newsgroups related to internet crimes, fraud, theft, anti-state elements, and other material of legal importance. Thus this study demonstrates a tag cloud of most frequent critical words in communications from legal and ethical point-of-view in the current scenario of cloud databases.
Year
DOI
Venue
2012
10.1109/UCC.2012.59
UCC
Keywords
Field
DocType
cloud text data mining,tag cloud,ethical communication,public newsgroups,unethical data,email newsgroup dataset,cloud databases,sample result dataset,cloud computing,recent cloud communication,categories separate cloud-server data,legal importance,computational linguistics,ethical norms,text analysis,law,censored data,hidden markov model,security,censorship,categorical data,data mining
Data mining,Categorization,World Wide Web,Norm (philosophy),Computer science,Gigabyte,Computational linguistics,Tag cloud,Hidden Markov model,Cloud computing,The Internet
Conference
ISSN
Citations 
PageRank 
2373-6860
0
0.34
References 
Authors
3
3
Name
Order
Citations
PageRank
Ahsan Nabi Khan100.34
Aslam Muhammad2229.31
A. M. Martinez-Enriquez3108.13