Title :
Smart Saint: An Active Semi-supervised Learning Internet Filter
Author :
Vargas Rigo, Felipe ; Nicacio Maraes, Murillo ; Takashi Matsubara, Edson
Author_Institution :
Fac. de Comput., Univ. Fed. de Mato Grosso do Sul, Campo Grande, Brazil
Abstract :
The Internet contains potentially harmful or inappropriate content in web pages that parents may not wish their children to access. A possible solution is an internet content filter, to block prohibitive content. This paper proposes a system called SMART SAINT, which can deliver high accuracy classifiers using a welcome combination of active semi-supervised learning and feature selection. The system proposes a combination of co-testing as an active semi-supervised learning method and Binormal Separation (BNS) as a feature selection method. We empirically evaluate the core implementation of the system on a real world dataset of approximately 10,000 web pages and results indicate that the combination is highly effective.
Keywords :
Web sites; classification; content management; feature selection; information filtering; learning (artificial intelligence); BNS; Internet content filter; Smart Saint; Web pages; active semisupervised learning; binormal separation; cotesting; feature selection; high accuracy classifiers; prohibitive content; Games; Internet; Measurement; Proposals; Semisupervised learning; Support vector machines; Training; active learning; machine learning; semi-supervised learning;
Conference_Titel :
Intelligent Systems (BRACIS), 2013 Brazilian Conference on
Conference_Location :
Fortaleza
DOI :
10.1109/BRACIS.2013.31