DocumentCode :
2005516
Title :
Chinese Web-page Classification Study
Author :
Huang, Weitong ; LuXiongXu ; Duan, Junfeng ; Lu, Yuchang
Author_Institution :
Tsinghua Univ., Beijing
fYear :
2007
fDate :
May 30 2007-June 1 2007
Firstpage :
1553
Lastpage :
1558
Abstract :
In recent years, Web-page classification has been a hotspot in the field of Web mining. And researches towards Chinese Web-page classification become more and more. In this paper, we introduce the details of a Chinese Web-page classification system that we implemented. Experiments show that our web-page preprocessing and feature selection method is effective. The classification accuracy acquired on a Chinese Web-page dataset is satisfying.
Keywords :
Internet; classification; data mining; Chinese Web-page classification; Web mining; Web page preprocessing; feature selection; Automatic control; Automation; Computer science; HTML; Internet; Search engines; Text categorization; Web mining; Web pages; World Wide Web; Chinese web-page; feature selection; web-page classification; web-page preprocessing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control and Automation, 2007. ICCA 2007. IEEE International Conference on
Conference_Location :
Guangzhou
Print_ISBN :
978-1-4244-0817-7
Electronic_ISBN :
978-1-4244-0818-4
Type :
conf
DOI :
10.1109/ICCA.2007.4376621
Filename :
4376621
Link To Document :
بازگشت