• DocumentCode
    2854285
  • Title

    Fast update of latent semantic spaces using a linear transform framework

  • Author

    Bellegarda, Jerome R.

  • Author_Institution
    Spoken Language Group, Apple Computer, Inc., Cupertino, California 95014, USA
  • Volume
    1
  • fYear
    2002
  • fDate
    13-17 May 2002
  • Abstract
    Latent semantic analysis suffers from a relatively high sensitivity to both task domain and composition style. Because the traditional “folding-in” process simply populates the existing semantic vector space with current data, performance degrades when training and operating conditions differ. On the other hand, recomputing the semantic space from scratch typically precludes real-time operation. An adaptation strategy therefore makes sense as a potential compromise. This paper exploits a linear transform framework to update the semantic space as new data becomes available. Leveraging suitable Cholesky factorizations makes the update computationally efficient. Experiments with different increment sizes are conducted, and the paper discusses the comparative merits of this approach under several scenarios.
  • Keywords
    Argon; Benchmark testing; Data mining; Matrix decomposition; Semantics; Strontium; USA Councils;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
  • Conference_Location
    Orlando, FL, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2002.5743831
  • Filename
    5743831