Title :
Incremental learning of aspect model on streaming documents
Author :
Chang, Te-Min ; Hsiao, Wen-Feng ; Wu, Cheng-Wei
Author_Institution :
Dept. of Inf. Manage., Nat. Sun Yat-sen Univ., Kaohsiung, Taiwan
fDate :
Nov. 30 2010-Dec. 2 2010
Abstract :
This research is to propose an IR related technique, the incremental aspect model (ISM), which not only uncovers latent aspects from the collected documents but also adapts the aspect model on streaming documents chronologically. ISM includes two stages: in Stage I, probabilistic latent semantic indexing (PLSI) technique is used to build a primary aspect model; and in Stage II, with out-of-date data removing and new data folding-in, the aspect model can be expanded using the derived spectral method if new aspects significantly exist. Two experiments on text clustering tasks are conducted accordingly. Results show the ISM has robust performance in terms of its incremental learning ability.
Keywords :
document handling; indexing; learning (artificial intelligence); pattern clustering; task analysis; IR related technique; ISM; data folding; document streaming; incremental aspect model; incremental learning ability; probabilistic latent semantic indexing technique; spectral method; text clustering task; Buildings; Convergence; Data models; Estimation; Indexing; Large scale integration; Semantics; Aspect Model; Incremental Learning; Probabilistic Latent Semantic Indexing; Text Clustering;
Conference_Titel :
Computer Sciences and Convergence Information Technology (ICCIT), 2010 5th International Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4244-8567-3
Electronic_ISBN :
978-89-88678-30-5
DOI :
10.1109/ICCIT.2010.5711084