DocumentCode
424467
Title
Computational Methods for Intelligent Information Access
Author
Berry, Michael W. ; Dumais, Susan T. ; Letsche, Todd A.
Author_Institution
University of Tennessee
fYear
1995
fDate
1995
Firstpage
20
Lastpage
20
Abstract
Currently, most approaches to retrieving textual materials from scientific databases depend on a lexical match between words in users’ requests and those in or assigned to documents in a database. Because of the tremendous diversity in the words people use to describe the same document, lexical methods are necessarily incomplete and imprecise. Using the singular value decomposition (SVD), one can take advantage of the implicit higher-order structure in the association of terms with documents by determining the SVD of large sparse term by document matrices. Terms and documents represented by 200-300 of the largest singular vectors are then matched against user queries. We call this retrieval method Latent Semantic Indexing (LSI) because the subspace represents important associative relationships between terms and documents that are not evident in individual documents. LSI is a completely automatic yet intelligent indexing method, widely applicable, and a promising way to improve users’ access to many kinds of textual materials, or to documents and services for which textual descriptions are available. A survey of the computational requirements for managing LSI-encoded databases as well as current and future applications of LSI is presented.
Keywords
indexing; information; latent; matrices; retrieval; semantic; singular value decomposition; sparse; updating; Computational intelligence; Computer science; Databases; Indexing; Information retrieval; Information science; Large scale integration; Matrix decomposition; Singular value decomposition; Sparse matrices; indexing; information; latent; matrices; retrieval; semantic; singular value decomposition; sparse; updating;
fLanguage
English
Publisher
ieee
Conference_Titel
Supercomputing, 1995. Proceedings of the IEEE/ACM SC95 Conference
Print_ISBN
0-89791-816-9
Type
conf
DOI
10.1109/SUPERC.1995.242666
Filename
1383154
Link To Document