• DocumentCode
    2833113
  • Title

    Improvement of Supervised Isomap Algorithm and Its Application to Visualization and Categorization of Web Chinese Text

  • Author

    Jia, Tu ; Yi, Wu

  • Author_Institution
    Dept. of Math. & Syst. Sci., Nat. Univ. of Defense Technol., Changsha
  • fYear
    2008
  • fDate
    Aug. 29 2008-Sept. 2 2008
  • Firstpage
    157
  • Lastpage
    161
  • Abstract
    It is important to reduce the dimensionality of features in Web Chinese text categorization. Isomap algorithm is an unsupervised manifold learning technique. SIIsomap algorithm, an extension of Isomap to supervised feature extraction, is proposed in this paper. It uses adding constant method and a direct embedding technique of Isomap algorithm for testing data to make the embedding more reasonable and easier. SIIsomap algorithm is applied to visualization and classification experiments of Web Chinese text as a feature extraction method. In contrast with existed methods, it gets better visualization and classification effects and illustrates the effectiveness of our method.
  • Keywords
    Internet; data mining; data visualisation; feature extraction; pattern classification; text analysis; unsupervised learning; Web Chinese text; data visualization; direct embedding technique; feature extraction method; pattern classification; supervised feature extraction; supervised isomap algorithm; unsupervised manifold learning technique; Application software; Computer science; Data visualization; Feature extraction; Information technology; Kernel; Principal component analysis; Testing; Text categorization; Training data; Feature Extraction; Isomap; Supervised Isomap; Web Chinese Text Categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Technology, 2008. ICCSIT '08. International Conference on
  • Conference_Location
    Singapore
  • Print_ISBN
    978-0-7695-3308-7
  • Type

    conf

  • DOI
    10.1109/ICCSIT.2008.56
  • Filename
    4624852