Title :
Using External Information for Classifying Tweets
Author :
Weissbock, Josh ; Esmin, Ahmed A. A. ; Inkpen, Diana
Author_Institution :
Sch. of Electr. Eng. & Comput. Sci., Univ. of Ottawa, Ottawa, ON, Canada
Abstract :
Automatic classification of texts by topic is a well-studied problem. Nonetheless, classifying twitter messages by topic is difficult because the messages are short and the features space for classification is very sparse. We propose a method to enhance the text of the messages that contain links with external information such as the title of the web pages, and with the most frequent terms from these web pages. We show that the results of the classification improve substantially when adding this external information.
Keywords :
pattern classification; social networking (online); text analysis; Twitter message classification; Web pages; automatic text classification; classification feature space; external information; tweets classification; Accuracy; Data models; Educational institutions; Support vector machines; Training data; Twitter; Web pages; Automatic Text Classification; Machine Learning; Twitter Messages;
Conference_Titel :
Intelligent Systems (BRACIS), 2013 Brazilian Conference on
Conference_Location :
Fortaleza
DOI :
10.1109/BRACIS.2013.9