DocumentCode :
814041
Title :
Structural semantic interconnections: a knowledge-based approach to word sense disambiguation
Author :
Navigli, Roberto ; Velardi, Paola
Author_Institution :
Dipartimento di Informatica, Universita of Roma "La Sapienza", Rome, Italy
Volume :
27
Issue :
7
fYear :
2005
fDate :
7/1/2005 12:00:00 AM
Firstpage :
1075
Lastpage :
1086
Abstract :
Word sense disambiguation (WSD) is traditionally considered an AI-hard problem. A break-through in this field would have a significant impact on many relevant Web-based applications, such as Web information retrieval, improved access to Web services, information extraction, etc. Early approaches to WSD, based on knowledge representation techniques, have been replaced in the past few years by more robust machine learning and statistical techniques. The results of recent comparative evaluations of WSD systems, however, show that these methods have inherent limitations. On the other hand, the increasing availability of large-scale, rich lexical knowledge resources seems to provide new challenges to knowledge-based approaches. In this paper, we present a method, called structural semantic interconnections (SSI), which creates structural specifications of the possible senses for each word in a context and selects the best hypothesis according to a grammar G, describing relations between sense specifications. Sense specifications are created from several available lexical resources that we integrated in part manually, in part with the help of automatic procedures. The SSI algorithm has been applied to different semantic disambiguation problems, like automatic ontology population, disambiguation of sentences in generic texts, disambiguation of words in glossary definitions. Evaluation experiments have been performed on specific knowledge domains (e.g., tourism, computer networks, enterprise interoperability), as well as on standard disambiguation test sets.
Keywords :
computational linguistics; grammars; ontologies (artificial intelligence); word processing; AI-hard problem; Web information retrieval; Web services; Web-based applications; automatic ontology population; generic texts; glossary definitions; grammar G; information extraction; knowledge representation techniques; knowledge-based approach; large-scale rich lexical knowledge resources; robust machine learning; semantic disambiguation problems; statistical techniques; structural semantic interconnections; word sense disambiguation; Availability; Data mining; Information retrieval; Knowledge representation; Large-scale systems; Machine learning; Ontologies; Robustness; Terminology; Web services; Index Terms- Natural language processing; ontology learning; structural pattern matching; word sense disambiguation.; Algorithms; Artificial Intelligence; Cluster Analysis; Computer Simulation; Dictionaries as Topic; Information Storage and Retrieval; Models, Statistical; Natural Language Processing; Numerical Analysis, Computer-Assisted; Pattern Recognition, Automated; Sequence Alignment; Sequence Analysis; Vocabulary, Controlled;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2005.149
Filename :
1432741
Link To Document :
بازگشت