DocumentCode
1273420
Title
Building hypertext links by computing semantic similarity
Author
Green, Stephen J.
Author_Institution
Div. of Inf. & Commun. Sci., Macquarie Univ., North Ryde, NSW, Australia
Volume
11
Issue
5
fYear
1999
Firstpage
713
Lastpage
730
Abstract
Most current automatic hypertext generation systems rely on term repetition to calculate the relatedness of two documents. There are well-recognized problems with such approaches, most notably, a vulnerability to the effects of synonymy (many words for the same concept) and polysemy (many concepts for the same word). We propose a novel method for automatic hypertext generation that is based on a technique called lexical chaining, a method for discovering sequences of related words in a text. This method uses a more general notion of document relatedness, and attempts to take into account the effects of synonymy and polysemy. We also present the results of an empirical study designed to test this method in the context of a question answering task from a database of newspaper articles
Keywords
computational linguistics; hypermedia; information retrieval; text analysis; automatic hypertext generation; automatic hypertext generation systems; document relatedness; hypertext links; information retrieval; lexical chaining; newspaper article database; polysemy; question answering task; semantic similarity; synonymy; term repetition; Automatic testing; Buildings; Computational linguistics; Costs; Databases; Humans; Information retrieval; Large-scale systems; System testing; Writing;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/69.806932
Filename
806932
Link To Document