DocumentCode :
168426
Title :
PageRank-based Word Sense Induction within Web Search Results Clustering
Author :
Moreno, Jose G. ; Dias, Guilherme
Author_Institution :
GREYC, Normandie Univ., Caen, France
fYear :
2014
fDate :
8-12 Sept. 2014
Firstpage :
465
Lastpage :
466
Abstract :
Word Sense Induction is an open problem in Natural Language Processing. Many recent works have been addressing this problem with a wide spectrum of strategies based on content analysis. In this paper, we present a sense induction strategy exclusively based on link analysis over the Web. In particular, we explore the idea that the main different senses of a given word share similar linking properties and can be found by performing clustering with link-based similarity metrics. The evaluation results show that PageRank-based sense induction achieves interesting results when compared to state-of-the-art content-based algorithms in the context of Web Search Results Clustering.
Keywords :
Internet; content management; natural language processing; pattern clustering; search engines; PageRank-based word sense induction; Web search results clustering; content analysis; link analysis; link-based similarity metrics; natural language processing; Algorithm design and analysis; Clustering algorithms; Joining processes; Kernel; Measurement; Web pages; Web search; PageRank Clustering; Web Links; Word Sense Induction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Digital Libraries (JCDL), 2014 IEEE/ACM Joint Conference on
Conference_Location :
London
Type :
conf
DOI :
10.1109/JCDL.2014.6970227
Filename :
6970227
Link To Document :
بازگشت