DocumentCode :
2823589
Title :
A Hybrid Web-Based Measure for Computing Semantic Relatedness Between Words
Author :
Spanakis, Gerasimos ; Siolas, Georgios ; Stafylopatis, Andreas
Author_Institution :
Intell. Syst. Lab., Nat. Tech. Univ. of Athens, Athens, Greece
fYear :
2009
fDate :
2-4 Nov. 2009
Firstpage :
441
Lastpage :
448
Abstract :
In this paper, we build a hybrid Web-based metric for computing semantic relatedness between words. The method exploits page counts, titles, snippets and URLs returned by a Web search engine. Our technique uses traditional information retrieval methods and is enhanced by page-count-based similarity scores which are integrated with automatically extracted lexico-synantic patterns from titles, snippets and URLs for all kinds of semantically related words provided by WordNet (synonyms, hypernyms, meronyms, antonyms). A support vector machine is used to solve the arising regression problem of word relatedness and the proposed method is evaluated on standard benchmark datasets. The method achieves an overall correlation of 0.88, which is the highest among other metrics up to date.
Keywords :
information retrieval; regression analysis; search engines; semantic Web; support vector machines; Web search engine; WordNet; hybrid Web-based measure; information retrieval methods; lexico-synantic pattern extraction; page-count-based similarity scores; regression analysis; support vector machine; Artificial intelligence; Content based retrieval; Data mining; Databases; Hybrid intelligent systems; Information retrieval; Laboratories; Search engines; Uniform resource locators; Web search; Web mining; semantic relatedness;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Tools with Artificial Intelligence, 2009. ICTAI '09. 21st International Conference on
Conference_Location :
Newark, NJ
ISSN :
1082-3409
Print_ISBN :
978-1-4244-5619-2
Electronic_ISBN :
1082-3409
Type :
conf
DOI :
10.1109/ICTAI.2009.64
Filename :
5363726
Link To Document :
بازگشت