• DocumentCode
    168426
  • Title

    PageRank-based Word Sense Induction within Web Search Results Clustering

  • Author

    Moreno, Jose G. ; Dias, Guilherme

  • Author_Institution
    GREYC, Normandie Univ., Caen, France
  • fYear
    2014
  • fDate
    8-12 Sept. 2014
  • Firstpage
    465
  • Lastpage
    466
  • Abstract
    Word Sense Induction is an open problem in Natural Language Processing. Many recent works have been addressing this problem with a wide spectrum of strategies based on content analysis. In this paper, we present a sense induction strategy exclusively based on link analysis over the Web. In particular, we explore the idea that the main different senses of a given word share similar linking properties and can be found by performing clustering with link-based similarity metrics. The evaluation results show that PageRank-based sense induction achieves interesting results when compared to state-of-the-art content-based algorithms in the context of Web Search Results Clustering.
  • Keywords
    Internet; content management; natural language processing; pattern clustering; search engines; PageRank-based word sense induction; Web search results clustering; content analysis; link analysis; link-based similarity metrics; natural language processing; Algorithm design and analysis; Clustering algorithms; Joining processes; Kernel; Measurement; Web pages; Web search; PageRank Clustering; Web Links; Word Sense Induction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Libraries (JCDL), 2014 IEEE/ACM Joint Conference on
  • Conference_Location
    London
  • Type

    conf

  • DOI
    10.1109/JCDL.2014.6970227
  • Filename
    6970227