• DocumentCode
    2337669
  • Title

    A fast approximate algorithm for large-scale Latent Semantic Indexing

  • Author

    Zhang, Dell ; Zhu, Zheng

  • Author_Institution
    SCSIS, Univ. of London, London
  • fYear
    2008
  • fDate
    13-16 Nov. 2008
  • Firstpage
    626
  • Lastpage
    631
  • Abstract
    Latent semantic indexing (LSI) is an effective method to discover the underlying semantic structure of data. It has numerous applications in information retrieval and data mining. However, the computational complexity of LSI may be prohibitively high when applied to very large datasets. In this paper, we present a fast approximate algorithm for large-scale LSI that is conceptually simple and theoretically justified. Our main contribution is to show that the proposed algorithm has provable error bound and linear computational complexity.
  • Keywords
    approximation theory; indexing; text analysis; data semantic structure; fast approximate algorithm; large-scale latent semantic indexing; Computational complexity; DNA; Data mining; Indexing; Information retrieval; Large scale integration; Large-scale systems; Linear algebra; Matrix decomposition; Object recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Information Management, 2008. ICDIM 2008. Third International Conference on
  • Conference_Location
    London
  • Print_ISBN
    978-1-4244-2916-5
  • Electronic_ISBN
    978-1-4244-2917-2
  • Type

    conf

  • DOI
    10.1109/ICDIM.2008.4746764
  • Filename
    4746764