Title :
GA based information retrieval system using dimensionality reduction techniques
Author :
Thakare, Anuradha D. ; Dhote, C.A. ; Chaudhari, Anagha N.
Author_Institution :
Dept. of Comput. Eng., Pimpri Chinchwad Coll. of Eng., Pune, India
Abstract :
In this paper, we propose a method of genetic algorithm (GA) for information retrieval (IR) based on Singular Value Decomposition and Principal Component Analysis. The main difficulty in GA based IR system is processing of high dimensional input strings, as affects the performance in terms of retrieval time. In proposed work, we tried to reduce the high dimensional input data to low dimensional in order to improve retrieval time. Singular Value Decomposition (SVD) and Principal Component Analysis (PCA) computations are done on input data streams to produce reduced rank matrix approximation and orthogonal transformations. With this reduced matrix we got promising results in terms of computation time when GA is applied for IR. Experiments were performed on sample dataset of 2500 input documents and weighted vectors are generated. Information retrieval is done using two techniques PCA-GA and SVD-GA with all classical matching functions. It is found that PCA-GA performs well as compared to SVD-GA in terms of computational time.
Keywords :
genetic algorithms; information retrieval; matrix algebra; principal component analysis; singular value decomposition; GA based IR system; GA based information retrieval system; PCA-GA; SVD-GA; data streams; dimensionality reduction techniques; genetic algorithm; high dimensional input data; high dimensional input strings; low dimensional input data; orthogonal transformations; principal component analysis; reduced rank matrix approximation; retrieval time; singular value decomposition; Clustering; Dimensionality Reduction; Information Retrieval; Latent Semantic Indexing; Principal Component Analysis; Singular Value Decomposition;
Conference_Titel :
Communication and Computing (ARTCom 2013), Fifth International Conference on Advances in Recent Technologies in
Conference_Location :
Bangalore
Print_ISBN :
978-1-84919-842-4
DOI :
10.1049/cp.2013.2207