DocumentCode
2833113
Title
Improvement of Supervised Isomap Algorithm and Its Application to Visualization and Categorization of Web Chinese Text
Author
Jia, Tu ; Yi, Wu
Author_Institution
Dept. of Math. & Syst. Sci., Nat. Univ. of Defense Technol., Changsha
fYear
2008
fDate
Aug. 29 2008-Sept. 2 2008
Firstpage
157
Lastpage
161
Abstract
It is important to reduce the dimensionality of features in Web Chinese text categorization. Isomap algorithm is an unsupervised manifold learning technique. SIIsomap algorithm, an extension of Isomap to supervised feature extraction, is proposed in this paper. It uses adding constant method and a direct embedding technique of Isomap algorithm for testing data to make the embedding more reasonable and easier. SIIsomap algorithm is applied to visualization and classification experiments of Web Chinese text as a feature extraction method. In contrast with existed methods, it gets better visualization and classification effects and illustrates the effectiveness of our method.
Keywords
Internet; data mining; data visualisation; feature extraction; pattern classification; text analysis; unsupervised learning; Web Chinese text; data visualization; direct embedding technique; feature extraction method; pattern classification; supervised feature extraction; supervised isomap algorithm; unsupervised manifold learning technique; Application software; Computer science; Data visualization; Feature extraction; Information technology; Kernel; Principal component analysis; Testing; Text categorization; Training data; Feature Extraction; Isomap; Supervised Isomap; Web Chinese Text Categorization;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Information Technology, 2008. ICCSIT '08. International Conference on
Conference_Location
Singapore
Print_ISBN
978-0-7695-3308-7
Type
conf
DOI
10.1109/ICCSIT.2008.56
Filename
4624852
Link To Document