DocumentCode
2337669
Title
A fast approximate algorithm for large-scale Latent Semantic Indexing
Author
Zhang, Dell ; Zhu, Zheng
Author_Institution
SCSIS, Univ. of London, London
fYear
2008
fDate
13-16 Nov. 2008
Firstpage
626
Lastpage
631
Abstract
Latent semantic indexing (LSI) is an effective method to discover the underlying semantic structure of data. It has numerous applications in information retrieval and data mining. However, the computational complexity of LSI may be prohibitively high when applied to very large datasets. In this paper, we present a fast approximate algorithm for large-scale LSI that is conceptually simple and theoretically justified. Our main contribution is to show that the proposed algorithm has provable error bound and linear computational complexity.
Keywords
approximation theory; indexing; text analysis; data semantic structure; fast approximate algorithm; large-scale latent semantic indexing; Computational complexity; DNA; Data mining; Indexing; Information retrieval; Large scale integration; Large-scale systems; Linear algebra; Matrix decomposition; Object recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital Information Management, 2008. ICDIM 2008. Third International Conference on
Conference_Location
London
Print_ISBN
978-1-4244-2916-5
Electronic_ISBN
978-1-4244-2917-2
Type
conf
DOI
10.1109/ICDIM.2008.4746764
Filename
4746764
Link To Document