DocumentCode
3425819
Title
Learning from Twitter Hashtags: Leveraging Proximate Tags to Enhance Graph-Based Keyphrase Extraction
Author
Bellaachia, Abdelghani ; Al-Dhelaan, M.
Author_Institution
Comput. Sci. Dept., George Washington Univ., Washington, DC, USA
fYear
2012
fDate
20-23 Nov. 2012
Firstpage
348
Lastpage
357
Abstract
In the micro-blogging service Twitter, the sparseness of text messages is an enormous obstacle in extracting key phrases from tweets. However, regardless of the sparseness in text, tweets include an abundant number of links in the form of hash tags. This paper investigates the possibility of leveraging hash tags in tweets to enhance the graph-based key phrase extraction. By using an auxiliary set of tweets found in hash tags, we show that we can improve extracting key phrases from tweets by augmenting the graph with a wider knowledge context. Specifically, we propose two different approaches for choosing the best hash tags links to use for enhancing graph-based key phrase extraction by either using a frequency approach or a hybrid approach that uses multiple methods for cleverly choosing the best hash tags. Experiments on the proposed approaches showed an improvement in the range of 9% to 37% over the case of ignoring the hash tag links.
Keywords
cryptography; graph theory; social networking (online); text analysis; Twitter hashtags; graph augmentation; graph-based keyphrase extraction; hashtag links; knowledge context; microblogging service; text messages; Blogs; Conferences; Context; Feature extraction; Frequency measurement; Twitter; Graph-based Ranking; Hashtag; Keyphrase Extraction; NE-Rank; Social Network Analysis; Twitter;
fLanguage
English
Publisher
ieee
Conference_Titel
Green Computing and Communications (GreenCom), 2012 IEEE International Conference on
Conference_Location
Besancon
Print_ISBN
978-1-4673-5146-1
Type
conf
DOI
10.1109/GreenCom.2012.58
Filename
6468336
Link To Document