Title :
Online Classification with Partially Labelled Texts
Author :
Shirai, Mikiyasu ; Miura, Tsuyoshi
Author_Institution :
Dept..of Electr. & Electr. Eng., HOSEI Univ., Koganei, Japan
Abstract :
In this investigation, we propose a novel approach to document stream classification using both online topic model and partially labelled documents. Although we may have several features for the classification, it seems natural that these features may vary dynamically depending upon the contents of stream. This is because they depend heavily on each theme within one class while we should follow dynamic mixture of them. Especially in the stream of news articles, word frequency changes dramatically because of bursts of the themes. Here we propose a dynamically learning method based on topic models assuming prior distribution of probabilities over classes adjusted by partially labelled documents in stream.
Keywords :
Internet; learning (artificial intelligence); pattern classification; statistical distributions; text analysis; document stream classification; dynamically learning method; news article stream; online classification; online topic model; partially labelled documents; partially labelled texts; probability distribution; stream content; word frequency; Adaptation models; Context modeling; Data models; Feature extraction; Maximum likelihood estimation; Probability distribution; Resource management; adaptive classification; document stream; online topic model; partially labelled documents; topic-burst;
Conference_Titel :
Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2014 IEEE/WIC/ACM International Joint Conferences on
Conference_Location :
Warsaw
DOI :
10.1109/WI-IAT.2014.128