DocumentCode :
3146350
Title :
Mining for Norms in Clouds: Complying to Ethical Communication through Cloud Text Data Mining
Author :
Khan, A. Nayeemulla ; Muhammad, Ajmal ; Martinez Enriquez, A.M.
Author_Institution :
Dept. of CS, Nat. Univ. of Comput. & Emerging Sci., Lahore, Pakistan
fYear :
2012
fDate :
5-8 Nov. 2012
Firstpage :
327
Lastpage :
332
Abstract :
As the world is realizing the power and efficiency of cloud computing, enhanced security and intelligence is needed in communication to filter out unethical data violating norms in clouds. No filtering categorization has been currently proposed. Numerous lists of banned, unethical and objectionable words have been developed with limited user satisfaction. Lists are usually manually generated, with some programmable extensibility for online forums and public newsgroups. We define a tool and methodology to categorize the censor data. We statistically grow words in the categorized data and tag the hidden neutral words with meaning in context. Using Computational Linguistics tools and modifying them to suit our means, we analyze sample text from gigabytes of email newsgroup dataset over Cloud Servers. A sample result dataset of the most frequently used words breaking the norms in recent cloud communication is presented in the results in broad categories. The categories separate cloud-server data found in newsgroups related to internet crimes, fraud, theft, anti-state elements, and other material of legal importance. Thus this study demonstrates a tag cloud of most frequent critical words in communications from legal and ethical point-of-view in the current scenario of cloud databases.
Keywords :
cloud computing; computational linguistics; data mining; ethical aspects; fraud; information filtering; information resources; law; security of data; text analysis; Internet crime; antistate element; banned words; censor data; cloud communication; cloud computing; cloud database; cloud server; cloud text data mining; cloud-server data; computational linguistics tools; critical words; data categorization; email newsgroup dataset; ethical communication; ethical point-of-view; filtering categorization; fraud; hidden neutral words; intelligence; legal importance; legal point-of-view; norm mining; objectionable words; online forum; public newsgroup; security; text analysis; theft; unethical data violating norms; unethical words; user satisfaction; Cloud computing; Data mining; Law; Security; Servers; Tag clouds; Censorship; Cloud Servers; Ethical Norms; Hidden Markov Model; Security; Tag Cloud; Text Data Mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Utility and Cloud Computing (UCC), 2012 IEEE Fifth International Conference on
Conference_Location :
Chicago, IL
Print_ISBN :
978-1-4673-4432-6
Type :
conf
DOI :
10.1109/UCC.2012.59
Filename :
6424968
Link To Document :
بازگشت