Title :
Using a Semi-automatic Keyword Dictionary for Improving Violent Web Site Filtering
Author :
Guermazi, Radhouane ; Hammami, Mohamed ; Ben Hamadou, Abdelmajid
Author_Institution :
MIRACL-ISIMS, Tunis
Abstract :
The development of the Web has been paralleled by the emergence of harmful Web pages content such as pornography, violence, racism,etc.This emergence involved the necessity of providing filtering systems designed to secure the internet access. In this paper, we propose a violent Web content detection and filtering system called "WebAngels filter" which uses textual and structural content-based analysis. These analysis are based on a violent keyword dictionary. We focus our attention on the keyword dictionary preparation, and we demonstrate that a semi-automatic keyword dictionary can be used to improve the filtering efficiency of violent Web pages.
Keywords :
Web sites; dictionaries; information filtering; text analysis; Internet; WebAngels filter; harmful Web pages; pornography; racism; semiautomatic keyword dictionary; structural content-based analysis; textual content-based analysis; violence; violent Web content detection; violent Web content filtering system; violent Web site filtering; Algorithm design and analysis; Dictionaries; Information filtering; Information filters; Internet; Learning systems; Machine learning; Signal design; Web page design; Web pages; Web classification and categorization; Web textual and structural content; data-mining; n-grams; violent Web sites filtering;
Conference_Titel :
Signal-Image Technologies and Internet-Based System, 2007. SITIS '07. Third International IEEE Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-0-7695-3122-9
DOI :
10.1109/SITIS.2007.137