DocumentCode
1611460
Title
Principal component analysis neural network for textual document categorization and dimension reduction
Author
Jaffali, Soufiene ; Jamoussi, Salma
Author_Institution
Syst. & Adv. Comput. Lab., Univ. of Sfax, Sfax, Tunisia
fYear
2012
Firstpage
835
Lastpage
839
Abstract
This manuscript presents the study and application of the method of principal component analysis (PCA) in the field of text mining. We began by studying the theoretical basis behind this method and we have focused on two of its variants namely the neural PCA and kernel PCA. We used neural PCA for automatic categorization of text documents through an extraction of semantic concepts. The second contribution of our work is the use of PCA (neuronal and kernel) for the dimension reduction of textual documents through the automatic classification.
Keywords
data mining; data reduction; neural nets; principal component analysis; text analysis; automatic classification; automatic text documents categorization; dimension reduction; kernel PCA; neural PCA; principal component analysis neural network; semantic concepts extraction; text mining; textual document categorization; Covariance matrix; Eigenvalues and eigenfunctions; Electronic mail; Kernel; Neurons; Principal component analysis; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Sciences of Electronics, Technologies of Information and Telecommunications (SETIT), 2012 6th International Conference on
Conference_Location
Sousse
Print_ISBN
978-1-4673-1657-6
Type
conf
DOI
10.1109/SETIT.2012.6482024
Filename
6482024
Link To Document