Title :
Incremental Learning for Text Document Classification
Author :
Chen, Zhihang ; Huang, Liping ; Murphey, Yi L.
Author_Institution :
Univ. of Michigan-Dearborn, Dearborn
Abstract :
This paper presents our research in incremental learning for text document classification. Incremental learning is important in text document classification since many applications have huge amount of training data, and training documents become available through time. We propose an incremental learning framework, ILTC(Incremental Learning of Text Classification) that involves the learning of features of text classes followed by an incremental Perceptron learning process. ILTC has the capabilities of incremental learning of new feature dimensions as well as new document classes. We applied the ILTC to a classification system of diagnostic text documents. The experiment results demonstrate that ILTC was able to incrementally learn new knowledge from newly available training data without either referring to the older training data or forgetting the already learnt knowledge.
Keywords :
classification; data mining; learning (artificial intelligence); perceptrons; text analysis; ILTC incremental learning framework; diagnostic text document classification system; incremental perceptron learning process; text mining; training data; Application software; Electronic mail; Natural languages; Neural networks; Organizing; Pattern analysis; Text categorization; Text mining; Training data; Web pages;
Conference_Titel :
Neural Networks, 2007. IJCNN 2007. International Joint Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
978-1-4244-1379-9
Electronic_ISBN :
1098-7576
DOI :
10.1109/IJCNN.2007.4371367