DocumentCode :
3138125
Title :
A New Model of Information Content for Semantic Similarity in WordNet
Author :
Zhou, Zili ; Wang, Yanna ; Gu, Junzhong
Author_Institution :
Coll. of Phys. & Eng., Qufu Normal Univ., Qufu
Volume :
3
fYear :
2008
fDate :
13-15 Dec. 2008
Firstpage :
85
Lastpage :
89
Abstract :
Information Content (IC) is an important dimension of assessing the semantic similarity between two terms or word senses in word knowledge. The conventional method of obtaining IC of word senses is to combine knowledge of their hierarchical structure from an ontology like WordNet with actual usage in text as derived from a large corpus. In this paper, a new model of IC is presented, which relies on hierarchical structure alone. The model considers not only the hyponyms of each word sense but also its depth in the structure. The IC value is easier to calculate based on our model, and when used as the basis of a similarity approach it yields judgments that correlate more closely with human assessments than others, which using IC value obtained only considering the hyponyms and IC value got by employing corpus analysis.
Keywords :
information retrieval; ontologies (artificial intelligence); WordNet; corpus analysis; human assessments; information content; semantic similarity; word knowledge; Artificial intelligence; Computer science; Conferences; Educational institutions; History; Humans; Integrated circuit modeling; Knowledge engineering; Ontologies; Physics; Information Content; Semantic Similarity; WordNet;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Future Generation Communication and Networking Symposia, 2008. FGCNS '08. Second International Conference on
Conference_Location :
Sanya
Print_ISBN :
978-1-4244-3430-5
Electronic_ISBN :
978-0-7695-3546-3
Type :
conf
DOI :
10.1109/FGCNS.2008.16
Filename :
4813554
Link To Document :
بازگشت