• DocumentCode
    2774918
  • Title

    Expert-Driven Topical Classification of Short Message Streams

  • Author

    Kamath, Krishna Y. ; Caverlee, James

  • Author_Institution
    Dept. of Comput. Sci., Texas A&M Univ., College Station, TX, USA
  • fYear
    2011
  • fDate
    9-11 Oct. 2011
  • Firstpage
    388
  • Lastpage
    393
  • Abstract
    We study the problem of expert-driven topical classification of short messages in time-evolving streams like Face book status updates, Twitter messages, and SMS communication. While high-level topics in these streams may be fixed (e.g., Sports, News), the content associated with these topics is typically less static, reflecting temporal change in interest as these streams evolve (e.g., tweets about the Olympics wane, while tweets about the World Cup rise in popularity). Coupled with this rapid concept drift, short messages themselves provide little contextual information and result in sparse features for effective classification. With these challenges in mind, we present an expert-driven framework for time-aware topical classification framework of short messages. Three of the salient features of the framework are (i) a novel expert-centric classifier, (ii) a sliding-window training for adaptive topical classification, and (iii) a suite of enrichment-based methods (lexical, link, collocation) for overcoming feature sparsity in short messages.
  • Keywords
    electronic messaging; pattern classification; social networking (online); Facebook status updates; SMS communication; Twitter messages; enrichment based methods; expert centric classifier; expert driven topical classification; feature sparsity; rapid concept drift; short message streams; sliding window training; time aware topical classification; time evolving streams; Entropy; Feature extraction; Frequency measurement; Medical services; Training; Twitter; classification; short-text; social media;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Privacy, Security, Risk and Trust (PASSAT) and 2011 IEEE Third Inernational Conference on Social Computing (SocialCom), 2011 IEEE Third International Conference on
  • Conference_Location
    Boston, MA
  • Print_ISBN
    978-1-4577-1931-8
  • Type

    conf

  • DOI
    10.1109/PASSAT/SocialCom.2011.213
  • Filename
    6113139