Title :
A research on Web resources automatic classification using SVMs
Author :
Wei, Cai ; Yongcheng, Wang ; Zhonghang, Yin ; Tao, Zou
Author_Institution :
Shanghai Jiao Tong Univ., China
Abstract :
With the rapid growth of Web information, text categorization has become an important research field for the management of Internet information. Most of the existing methods are based on traditional statistics, which provides a conclusion only for the situation where sample size is tending to infinity. So they may not work well in practical case with limited samples and easily lead to the problem of over-fitting. These papers theoretical analyze the reason of over-fitting and introduce its condition as well as the method to solve it. We introduce SVMs, a method to avoid over-fitting, which is based on statistical learning theory matching the limited number of Internet news examples.
Keywords :
Bayes methods; Internet; decision theory; learning (artificial intelligence); learning automata; text analysis; Internet information management; SVMs; Web information; Web resources; automatic classification; statistical learning theory; support vector machines; text categorization; H infinity control; Information filtering; Information filters; Information management; Resource management; Statistical learning; Statistics; Support vector machine classification; Support vector machines; Text categorization;
Conference_Titel :
Intelligent Control and Automation, 2002. Proceedings of the 4th World Congress on
Print_ISBN :
0-7803-7268-9
DOI :
10.1109/WCICA.2002.1020803