DocumentCode
2642879
Title
Identifying vulgar content in eMule network through text classification
Author
Liu, Xiangtao ; Cheng, Xueqi ; Li, Jingyuan ; Zhai, Haijun ; Bai, Shuo
Author_Institution
Inst. of Comput. Technol., Chinese Acad. of Sci., Beijing, China
fYear
2010
fDate
23-26 May 2010
Firstpage
168
Lastpage
168
Abstract
Through years of development, the cyberspace has been dominated by traffic of peer-to-peer (P2P) file sharing applications. Among them, eMule is especially favored by millions of P2P users all over the world. However, it is very difficult to manage the content which is delivered through eMule due to its distributed property, thus a large number of vulgar content (e.g., pornographic and violent files) is existing in eMule. Since children and adolescents are the main force of eMule users, it is quite necessary to provide an efficient method to identify and filter the vulgar content for the sake of innocent children and adolescents. In this study, an automatic framework based on text classification is proposed to identify and filter vulgar content in eMule. Filename is used as the feature to carry out the elementary research on the effectiveness of our framework, although filename may be changed freely by eMule users. We aim to achieve high accuracy when identifying and filtering vulgar content, thus to raise the quality of the content delivered in eMule to a higher level.
Keywords
Assembly; Computers; Content management; Crawlers; Filtering; Filters; Peer to peer computing; Search engines; Spatial databases; Text categorization;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligence and Security Informatics (ISI), 2010 IEEE International Conference on
Conference_Location
Vancouver, BC, Canada
Print_ISBN
978-1-4244-6444-9
Type
conf
DOI
10.1109/ISI.2010.5484751
Filename
5484751
Link To Document