DocumentCode :
124256
Title :
Online Classification with Partially Labelled Texts
Author :
Shirai, Mikiyasu ; Miura, Tsuyoshi
Author_Institution :
Dept..of Electr. & Electr. Eng., HOSEI Univ., Koganei, Japan
Volume :
2
fYear :
2014
fDate :
11-14 Aug. 2014
Firstpage :
421
Lastpage :
428
Abstract :
In this investigation, we propose a novel approach to document stream classification using both online topic model and partially labelled documents. Although we may have several features for the classification, it seems natural that these features may vary dynamically depending upon the contents of stream. This is because they depend heavily on each theme within one class while we should follow dynamic mixture of them. Especially in the stream of news articles, word frequency changes dramatically because of bursts of the themes. Here we propose a dynamically learning method based on topic models assuming prior distribution of probabilities over classes adjusted by partially labelled documents in stream.
Keywords :
Internet; learning (artificial intelligence); pattern classification; statistical distributions; text analysis; document stream classification; dynamically learning method; news article stream; online classification; online topic model; partially labelled documents; partially labelled texts; probability distribution; stream content; word frequency; Adaptation models; Context modeling; Data models; Feature extraction; Maximum likelihood estimation; Probability distribution; Resource management; adaptive classification; document stream; online topic model; partially labelled documents; topic-burst;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2014 IEEE/WIC/ACM International Joint Conferences on
Conference_Location :
Warsaw
Type :
conf
DOI :
10.1109/WI-IAT.2014.128
Filename :
6927655
Link To Document :
بازگشت