Title :
Class Selection Based Iterative Supervised Latent Semantic Indexing for Text Categorization
Author :
Wang, Ming-Bo ; Liu, Cheng-Lin
Author_Institution :
Nat. Lab. of Pattern Recognition (NLPR), Chinese Acad. of Sci., Beijing, China
Abstract :
Latent semantic indexing (LSI) is an effective technique for feature extraction in text mining, and supervised LSI (SLSI) algorithms have been proposed to exploit the class labels of training data. In this paper, we propose an iterative SLSI framework based on class selection. We show that a previous iterative SLSI algorithm is an instance of the framework. We also propose a method under our framework, which selects a class at each iteration using a simple classifier and computes the main bias vector of one class only. Our experiments demonstrate that the proposed method both improves the classification accuracy and reduces the computation cost.
Keywords :
data mining; feature extraction; indexing; text analysis; class selection; feature extraction; iterative supervised latent semantic indexing; text categorization; text mining; Feature extraction; Indexing; Iterative algorithms; Laboratories; Large scale integration; Matrix decomposition; Principal component analysis; Text categorization; Text mining; Training data;
Conference_Titel :
Information Engineering and Computer Science, 2009. ICIECS 2009. International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-4994-1
DOI :
10.1109/ICIECS.2009.5366100