• DocumentCode
    1787475
  • Title

    Feature Selection for Twitter Classification

  • Author

    Ostrowski, David Alfred

  • fYear
    2014
  • fDate
    16-18 June 2014
  • Firstpage
    267
  • Lastpage
    272
  • Abstract
    Twitter-based messages have presented challenges in the identification of features as applied to classification. This paper explores filtering techniques for improved trend detection and information extraction. Starting with a pre-filtered source (Twitter), we will examine the application of both information theory and Natural Language Processing (NLP) based techniques as a means of preprocessing for classification. Results demonstrate that both means allow for improved results in classification among highly idiosyncratic data (Twitter).
  • Keywords
    feature selection; information filtering; information theory; natural language processing; pattern classification; social networking (online); NLP based techniques; Twitter classification; Twitter-based messages; feature identification; feature selection; filtering techniques; improved trend detection; information extraction; information theory; natural language processing; pre-filtered source; Classification algorithms; Filtering theory; Market research; Measurement; Mutual information; Twitter; Classification; Machine learning; Natural Language Processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Semantic Computing (ICSC), 2014 IEEE International Conference on
  • Conference_Location
    Newport Beach, CA
  • Print_ISBN
    978-1-4799-4002-8
  • Type

    conf

  • DOI
    10.1109/ICSC.2014.50
  • Filename
    6882039