Title :
System of negative Indonesian website detection using TF-IDF and Vector Space Model
Author :
Adji, Teguh Bharata ; Abidin, Zainil ; Nugroho, Hanung Adi
Author_Institution :
Dept. of Electr. Enginering & Inf. Technol., Univ. Gadjah, Yogyakarta, Indonesia
Abstract :
Systems to filter negative (pornography) websites are widely established by several researchers. However, those systems are developed for English websites. There is a system to filter negative Indonesian website. However, it works based on URL database. This research developed negative Indonesian website filter which is based on content filtering using TF-IDF (Term Frequency-Inverse Document Frequency) and VSM (Vector Space Model). The accuracy of the system classification is 82.80%.
Keywords :
Web sites; content management; information filtering; information filters; English Web sites; TF-IDF; URL database; VSM; content filtering; negative Indonesian Web site detection; negative Indonesian Web site filter; negative Web sites filter; pornography Web sites; system classification; term frequency inverse document frequency; vector space model; Accuracy; Classification algorithms; Information filters; Support vector machines; Uniform resource locators; TF-IDF; Vector Space Model; classification; pornography;
Conference_Titel :
Electrical Engineering and Computer Science (ICEECS), 2014 International Conference on
Print_ISBN :
978-1-4799-8477-0
DOI :
10.1109/ICEECS.2014.7045240