Title :
Structural classification based correlation and its application to principal component analysis for high-dimension low-sample size data
Author_Institution :
Fac. of Eng., Inf. & Syst., Univ. of Tsukuba, Tsukuba, Japan
Abstract :
This paper proposes a structural classification based correlation and application to principal component analysis (PCA) for high-dimension low-sample size (HDLSS) data. The structural classification based correlation consists of two kinds of correlations; correlation of objects over variables and correlation of classification structures of objects over clusters. Therefore, this correlation can measure not only the similarity of objects but also the similarity of classification structures. We exploit this correlation to PCA whose target data is HDLSS data in which the number of variables is much larger than the number of objects. Since it is known that we cannot obtain correct solutions as the eigen-values of the covariance matrix of variables for HDLSS data and the result of ordinary PCA is based on eigen-values of the covariance matrix of variables, if we apply the ordinary PCA for HDLSS data, we cannot obtain the correct result. In order to solve this problem, we exploit the proposed structural classification based correlation with respect to variables. Since this correlation includes the correlation of classification structures, we can solve this problem and obtain a similarity relationship of objects in a lower dimensional space spanned by the obtained principal components. From several numerical examples, we show the effectiveness of our proposed principal component analysis using the structural classification based correlation for the HDLSS data.
Keywords :
correlation methods; covariance matrices; data handling; eigenvalues and eigenfunctions; fuzzy set theory; pattern classification; pattern clustering; principal component analysis; HDLSS data; PCA; classification structure similarity; covariance matrix eigen-values; fuzzy clustering model; high-dimension low-sample size data; principal component analysis; structural classification based correlation; Clustering methods; Correlation; Covariance matrix; Data models; Equations; Mathematical model; Principal component analysis; classification structure; correlation based analysis; fuzzy cluster; fuzzy clustering model;
Conference_Titel :
Fuzzy Systems (FUZZ-IEEE), 2012 IEEE International Conference on
Conference_Location :
Brisbane, QLD
Print_ISBN :
978-1-4673-1507-4
Electronic_ISBN :
1098-7584
DOI :
10.1109/FUZZ-IEEE.2012.6251200