DocumentCode
2774918
Title
Expert-Driven Topical Classification of Short Message Streams
Author
Kamath, Krishna Y. ; Caverlee, James
Author_Institution
Dept. of Comput. Sci., Texas A&M Univ., College Station, TX, USA
fYear
2011
fDate
9-11 Oct. 2011
Firstpage
388
Lastpage
393
Abstract
We study the problem of expert-driven topical classification of short messages in time-evolving streams like Face book status updates, Twitter messages, and SMS communication. While high-level topics in these streams may be fixed (e.g., Sports, News), the content associated with these topics is typically less static, reflecting temporal change in interest as these streams evolve (e.g., tweets about the Olympics wane, while tweets about the World Cup rise in popularity). Coupled with this rapid concept drift, short messages themselves provide little contextual information and result in sparse features for effective classification. With these challenges in mind, we present an expert-driven framework for time-aware topical classification framework of short messages. Three of the salient features of the framework are (i) a novel expert-centric classifier, (ii) a sliding-window training for adaptive topical classification, and (iii) a suite of enrichment-based methods (lexical, link, collocation) for overcoming feature sparsity in short messages.
Keywords
electronic messaging; pattern classification; social networking (online); Facebook status updates; SMS communication; Twitter messages; enrichment based methods; expert centric classifier; expert driven topical classification; feature sparsity; rapid concept drift; short message streams; sliding window training; time aware topical classification; time evolving streams; Entropy; Feature extraction; Frequency measurement; Medical services; Training; Twitter; classification; short-text; social media;
fLanguage
English
Publisher
ieee
Conference_Titel
Privacy, Security, Risk and Trust (PASSAT) and 2011 IEEE Third Inernational Conference on Social Computing (SocialCom), 2011 IEEE Third International Conference on
Conference_Location
Boston, MA
Print_ISBN
978-1-4577-1931-8
Type
conf
DOI
10.1109/PASSAT/SocialCom.2011.213
Filename
6113139
Link To Document