DocumentCode :
3739257
Title :
Text Classification Models for Web Content Filtering and Online Safety
Author :
Shuhua Liu;Thomas Forss
Author_Institution :
Dept. of Bus. Manage. &
fYear :
2015
Firstpage :
961
Lastpage :
968
Abstract :
Living in an era of anywhere anytime connectedness for the great mass, safety and security on the web presents enormous challenges. There is a great need for better content detection systems that can more accurately identify excessively offensive and harmful websites. Web classification models in the early days are limited by the methods and data available. Today advanced developments in computing methodologies and technology have brought us many new and better means for text content analysis, for example new methods for topic extraction, topic modeling and sentiment analysis. Our recent studies suggested the promising potential of combing topic analysis and sentiment analysis in web content classification. This paper further explores new classification models for better classification performance, especially to enhance precision and reduce false positives, by incorporation of semantics in developing classification models and by examination and handling of the issues with the dataset reliability, class imbalance and covariate shift.
Keywords :
"Web pages","Feature extraction","Semantics","Sentiment analysis","Computational modeling","Analytical models","Data mining"
Publisher :
ieee
Conference_Titel :
Data Mining Workshop (ICDMW), 2015 IEEE International Conference on
Electronic_ISBN :
2375-9259
Type :
conf
DOI :
10.1109/ICDMW.2015.143
Filename :
7395771
Link To Document :
بازگشت