• DocumentCode
    3311912
  • Title

    Dimensions of meaning

  • Author

    Schütze, Hinrich

  • Author_Institution
    Center for the Study of Language & Inf., Stanford, CA, USA
  • fYear
    1992
  • fDate
    16-20 Nov 1992
  • Firstpage
    787
  • Lastpage
    796
  • Abstract
    The representation of documents and queries as vectors in a high-dimensional space is well-established in information retrieval. The author proposes that the semantics of words and contexts in a text be represented as vectors. The dimensions of the space are words and the initial vectors are determined by the words occurring close to the entity to be represented, which implies that the space has several thousand dimensions (words). This makes the vector representations (which are dense) too cumbersome to use directly. Therefore, dimensionality reduction by means of a singular value decomposition is employed. The author analyzes the structure of the vector representations and applies them to word sense disambiguation and thesaurus induction
  • Keywords
    information retrieval; linguistics; information retrieval; semantics of words; thesaurus induction; vector representations; word sense disambiguation; Context modeling; Information retrieval; Multidimensional systems; Singular value decomposition; Thesauri;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Supercomputing '92., Proceedings
  • Conference_Location
    Minneapolis, MN
  • Print_ISBN
    0-8186-2630-5
  • Type

    conf

  • DOI
    10.1109/SUPERC.1992.236684
  • Filename
    236684