DocumentCode :
1957740
Title :
Chinese Automatic Documents Classification System
Author :
Li, Li-Rui ; Yang, Kai
Author_Institution :
Inf. Eng. Dept., Henan Vocational & Tech. Inst., Zhengzhou, China
Volume :
5
fYear :
2010
fDate :
9-11 July 2010
Firstpage :
324
Lastpage :
327
Abstract :
Chinese Web Automatic Document Classification is one of the core technologies in Chinese information retrieval. Web Spider technology is the key in Chinese WEB document automatic classification. this issue surrounds WEB information explore which is this cutting-edge research, combined with the overall requirements of the Chinese WEB Document Classification System Framework, achieving roaming of the network spiders on the Internet, and applying to improved algorithm of network spider which mainly solutes some of the problems encountered in the Chinese word search in the current information retrieval.
Keywords :
Internet; document handling; information retrieval; natural language processing; pattern classification; Chinese Web automatic document classification; Chinese information retrieval; Chinese word search; Internet; Web spider technology; Bayesian methods; Feature extraction; HTML; Chinese Automatic Classification; Link Tracking Method; Web Spider;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Information Technology (ICCSIT), 2010 3rd IEEE International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-5537-9
Type :
conf
DOI :
10.1109/ICCSIT.2010.5565018
Filename :
5565018
Link To Document :
بازگشت