Title :
Online classifiers for Chinese text classification and filtering
Author :
Guo, Yanhui ; Liu, Jianyi ; Wang, Cong ; Zhong, Yixin
Author_Institution :
Res. Center of Intelligence Eng. Dept., Beijing Univ. of Posts & Telecommun., China
Abstract :
In text classification systems, the quality and quantity of the training document is one of the most import factors which affect performance. But gathering, filtering and classifying the training documents is very difficult. This paper employs feedback mechanism for text classification systems and reduces the need for labeled training documents by unifying the strengths of k-NN and linear classifiers. It collects various types of example documents provided by the users and set up the user profiles. While the documents from stream are matched with the profiles, the relevant documents are used to improve the user profile. The online approach offers the advantage of continuous learning in the batch-adaptive text considerations, on the batch-adaptive filtering task.
Keywords :
Internet; classification; information filters; learning (artificial intelligence); natural languages; text analysis; Chinese text classification systems; batch-adaptive filtering task; batch-adaptive text considerations; k-NN classifier; labeled training documents; linear classifier; online classifier; text filtering; Electronic equipment testing; Electronic mail; Feedback; Information filtering; Information filters; Marketing and sales; Postal services; Text categorization; Web pages; Web sites;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on
Conference_Location :
Beijing, China
Print_ISBN :
0-7803-7902-0
DOI :
10.1109/NLPKE.2003.1275988