Title of article
Pattern recognition based on canonical correlations in a high dimension low sample size context
Author/Authors
Tamatani، نويسنده , , Mitsuru and Koch، نويسنده , , Inge and Naito، نويسنده , , Kanta، نويسنده ,
Issue Information
دوفصلنامه با شماره پیاپی سال 2012
Pages
18
From page
350
To page
367
Abstract
This paper is concerned with pattern recognition for 2-class problems in a High Dimension Low Sample Size (hdlss) setting. The proposed method is based on canonical correlations between the predictors X and responses Y . The paper proposes a modified version of the canonical correlation matrix Σ X − 1 / 2 Σ X Y Σ Y − 1 / 2 which is suitable for discrimination with class labels Y in a hdlss context. The modified canonical correlation matrix yields ranking vectors for variable selection, a discriminant direction and a rule which is essentially equivalent to the naive Bayes rule. The paper examines the asymptotic behavior of the ranking vectors and the discriminant direction and gives precise conditions for hdlss consistency in terms of the growth rates of the dimension and sample size. The feature selection induced by the discriminant direction as ranking vector is shown to work efficiently in simulations and in applications to real hdlss data.
Keywords
canonical correlations , Consistency , High Dimension Low Sample Size , Misclassification , Naive Bayes rule
Journal title
Journal of Multivariate Analysis
Serial Year
2012
Journal title
Journal of Multivariate Analysis
Record number
1565935
Link To Document