DocumentCode
3311912
Title
Dimensions of meaning
Author
Schütze, Hinrich
Author_Institution
Center for the Study of Language & Inf., Stanford, CA, USA
fYear
1992
fDate
16-20 Nov 1992
Firstpage
787
Lastpage
796
Abstract
The representation of documents and queries as vectors in a high-dimensional space is well-established in information retrieval. The author proposes that the semantics of words and contexts in a text be represented as vectors. The dimensions of the space are words and the initial vectors are determined by the words occurring close to the entity to be represented, which implies that the space has several thousand dimensions (words). This makes the vector representations (which are dense) too cumbersome to use directly. Therefore, dimensionality reduction by means of a singular value decomposition is employed. The author analyzes the structure of the vector representations and applies them to word sense disambiguation and thesaurus induction
Keywords
information retrieval; linguistics; information retrieval; semantics of words; thesaurus induction; vector representations; word sense disambiguation; Context modeling; Information retrieval; Multidimensional systems; Singular value decomposition; Thesauri;
fLanguage
English
Publisher
ieee
Conference_Titel
Supercomputing '92., Proceedings
Conference_Location
Minneapolis, MN
Print_ISBN
0-8186-2630-5
Type
conf
DOI
10.1109/SUPERC.1992.236684
Filename
236684
Link To Document