• DocumentCode
    3425819
  • Title

    Learning from Twitter Hashtags: Leveraging Proximate Tags to Enhance Graph-Based Keyphrase Extraction

  • Author

    Bellaachia, Abdelghani ; Al-Dhelaan, M.

  • Author_Institution
    Comput. Sci. Dept., George Washington Univ., Washington, DC, USA
  • fYear
    2012
  • fDate
    20-23 Nov. 2012
  • Firstpage
    348
  • Lastpage
    357
  • Abstract
    In the micro-blogging service Twitter, the sparseness of text messages is an enormous obstacle in extracting key phrases from tweets. However, regardless of the sparseness in text, tweets include an abundant number of links in the form of hash tags. This paper investigates the possibility of leveraging hash tags in tweets to enhance the graph-based key phrase extraction. By using an auxiliary set of tweets found in hash tags, we show that we can improve extracting key phrases from tweets by augmenting the graph with a wider knowledge context. Specifically, we propose two different approaches for choosing the best hash tags links to use for enhancing graph-based key phrase extraction by either using a frequency approach or a hybrid approach that uses multiple methods for cleverly choosing the best hash tags. Experiments on the proposed approaches showed an improvement in the range of 9% to 37% over the case of ignoring the hash tag links.
  • Keywords
    cryptography; graph theory; social networking (online); text analysis; Twitter hashtags; graph augmentation; graph-based keyphrase extraction; hashtag links; knowledge context; microblogging service; text messages; Blogs; Conferences; Context; Feature extraction; Frequency measurement; Twitter; Graph-based Ranking; Hashtag; Keyphrase Extraction; NE-Rank; Social Network Analysis; Twitter;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Green Computing and Communications (GreenCom), 2012 IEEE International Conference on
  • Conference_Location
    Besancon
  • Print_ISBN
    978-1-4673-5146-1
  • Type

    conf

  • DOI
    10.1109/GreenCom.2012.58
  • Filename
    6468336