Title :
Feature Selection for Twitter Classification
Author :
Ostrowski, David Alfred
Abstract :
Twitter-based messages have presented challenges in the identification of features as applied to classification. This paper explores filtering techniques for improved trend detection and information extraction. Starting with a pre-filtered source (Twitter), we will examine the application of both information theory and Natural Language Processing (NLP) based techniques as a means of preprocessing for classification. Results demonstrate that both means allow for improved results in classification among highly idiosyncratic data (Twitter).
Keywords :
feature selection; information filtering; information theory; natural language processing; pattern classification; social networking (online); NLP based techniques; Twitter classification; Twitter-based messages; feature identification; feature selection; filtering techniques; improved trend detection; information extraction; information theory; natural language processing; pre-filtered source; Classification algorithms; Filtering theory; Market research; Measurement; Mutual information; Twitter; Classification; Machine learning; Natural Language Processing;
Conference_Titel :
Semantic Computing (ICSC), 2014 IEEE International Conference on
Conference_Location :
Newport Beach, CA
Print_ISBN :
978-1-4799-4002-8
DOI :
10.1109/ICSC.2014.50