Title :
Dimensions of meaning
Author :
Schütze, Hinrich
Author_Institution :
Center for the Study of Language & Inf., Stanford, CA, USA
Abstract :
The representation of documents and queries as vectors in a high-dimensional space is well-established in information retrieval. The author proposes that the semantics of words and contexts in a text be represented as vectors. The dimensions of the space are words and the initial vectors are determined by the words occurring close to the entity to be represented, which implies that the space has several thousand dimensions (words). This makes the vector representations (which are dense) too cumbersome to use directly. Therefore, dimensionality reduction by means of a singular value decomposition is employed. The author analyzes the structure of the vector representations and applies them to word sense disambiguation and thesaurus induction
Keywords :
information retrieval; linguistics; information retrieval; semantics of words; thesaurus induction; vector representations; word sense disambiguation; Context modeling; Information retrieval; Multidimensional systems; Singular value decomposition; Thesauri;
Conference_Titel :
Supercomputing '92., Proceedings
Conference_Location :
Minneapolis, MN
Print_ISBN :
0-8186-2630-5
DOI :
10.1109/SUPERC.1992.236684