• DocumentCode
    3002366
  • Title

    The limitation of the SVD for latent semantic indexing

  • Author

    Mirzal, Andri

  • Author_Institution
    Fac. of Comput., Univ. Teknol. Malaysia, Skudai, Malaysia
  • fYear
    2013
  • fDate
    Nov. 29 2013-Dec. 1 2013
  • Firstpage
    413
  • Lastpage
    416
  • Abstract
    Latent semantic indexing (LSI) is an indexing method for improving retrieval performance of an information retrieval system by grouping related documents to the same clusters so that each of these documents indexes the same (or almost the same) words, and unrelated documents index (relatively) different words. The de facto standard method for LSI is the truncated singular value decomposition (SVD). In this paper, we show that the LSI capability of the truncated SVD is not as conclusive as previously reported; rather it is a conditional aspect and when the condition is not met, then the truncated SVD can fail in recognizing the related documents resulting in a poor retrieval performance.
  • Keywords
    document handling; indexing; information retrieval; information retrieval systems; singular value decomposition; LSI capability; de facto standard method; information retrieval system; latent semantic indexing; related documents; retrieval performance; truncated SVD; truncated singular value decomposition; Approximation methods; Indexing; Large scale integration; Rivers; Semantics; Vectors; document analysis; indexing; information retrieval; latent semantic; singular value decomposition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control System, Computing and Engineering (ICCSCE), 2013 IEEE International Conference on
  • Conference_Location
    Mindeb
  • Print_ISBN
    978-1-4799-1506-4
  • Type

    conf

  • DOI
    10.1109/ICCSCE.2013.6720000
  • Filename
    6720000