• DocumentCode
    1611460
  • Title

    Principal component analysis neural network for textual document categorization and dimension reduction

  • Author

    Jaffali, Soufiene ; Jamoussi, Salma

  • Author_Institution
    Syst. & Adv. Comput. Lab., Univ. of Sfax, Sfax, Tunisia
  • fYear
    2012
  • Firstpage
    835
  • Lastpage
    839
  • Abstract
    This manuscript presents the study and application of the method of principal component analysis (PCA) in the field of text mining. We began by studying the theoretical basis behind this method and we have focused on two of its variants namely the neural PCA and kernel PCA. We used neural PCA for automatic categorization of text documents through an extraction of semantic concepts. The second contribution of our work is the use of PCA (neuronal and kernel) for the dimension reduction of textual documents through the automatic classification.
  • Keywords
    data mining; data reduction; neural nets; principal component analysis; text analysis; automatic classification; automatic text documents categorization; dimension reduction; kernel PCA; neural PCA; principal component analysis neural network; semantic concepts extraction; text mining; textual document categorization; Covariance matrix; Eigenvalues and eigenfunctions; Electronic mail; Kernel; Neurons; Principal component analysis; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Sciences of Electronics, Technologies of Information and Telecommunications (SETIT), 2012 6th International Conference on
  • Conference_Location
    Sousse
  • Print_ISBN
    978-1-4673-1657-6
  • Type

    conf

  • DOI
    10.1109/SETIT.2012.6482024
  • Filename
    6482024