DocumentCode :
468314
Title :
Learning Locality Discriminating Indexing for Text Categorization
Author :
Hu, Jiani ; Deng, Weihong ; Guo, Jun ; Xu, Weiran
Author_Institution :
Beijing Univ. of Posts & Telecommun., Beijing
Volume :
3
fYear :
2007
fDate :
24-27 Aug. 2007
Firstpage :
239
Lastpage :
242
Abstract :
This paper introduces a locality discriminating indexing (LDI) algorithm for text categorization. The LDI algorithm offers a manifold way of discriminant analysis. Based on the hypothesis that samples from different classes reside in class-specific manifold structures, the algorithm depicts the manifold structures by a nearest-native graph and a invader graphs. And a new locality discriminant criterion is proposed, which best preserves the within-class local structures while suppresses the between-class overlap. Using the notion of the Laplacian of the graphs, the LDI algorithm finds the optimal linear transformation by solving the generalized eigenvalue problem. The feasibility of the LDI algorithm has been successfully tested in text categorization using 20NG and Reuters-21578 databases. Experiment results show LDI is an effective technique for document modeling and representations for classification.
Keywords :
eigenvalues and eigenfunctions; graphs; indexing; text analysis; discriminant analysis; document modeling; generalized eigenvalue problem; invader graph; learning; locality discriminant criterion; locality discriminating indexing algorithm; nearest-native graph; optimal linear transformation; text categorization; Algorithm design and analysis; Clustering algorithms; Eigenvalues and eigenfunctions; Indexing; Laplace equations; Large scale integration; Linear discriminant analysis; Scattering; Testing; Text categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
Conference_Location :
Haikou
Print_ISBN :
978-0-7695-2874-8
Type :
conf
DOI :
10.1109/FSKD.2007.383
Filename :
4406236
Link To Document :
بازگشت