Title :
Extending Attribute Information for Small Data Set Classification
Author :
Li, Der-Chiang ; Liu, Chiao-Wen
Author_Institution :
Dept. of Ind. & Inf. Manage., Nat. Cheng Kung Univ., Tainan, Taiwan
fDate :
3/1/2012 12:00:00 AM
Abstract :
Data quantity is the main issue in the small data set problem, because usually insufficient data will not lead to a robust classification performance. How to extract more effective information from a small data set is thus of considerable interest. This paper proposes a new attribute construction approach which converts the original data attributes into a higher dimensional feature space to extract more attribute information by a similarity-based algorithm using the classification-oriented fuzzy membership function. Seven data sets with different attribute sizes are employed to examine the performance of the proposed method. The results show that the proposed method has a superior classification performance when compared to principal component analysis (PCA), kernel principal component analysis (KPCA), and kernel independent component analysis (KICA) with a Gaussian kernel in the support vector machine (SVM) classifier.
Keywords :
Gaussian processes; data handling; fuzzy set theory; information retrieval; pattern classification; principal component analysis; support vector machines; Gaussian kernel; attribute construction approach; attribute information; attribute information extraction; classification-oriented fuzzy membership function; data attributes; data quantity; feature space; kernel independent component analysis; kernel principal component analysis; principal component analysis; similarity-based algorithm; small data set classification; support vector machine classifier; Accuracy; Artificial neural networks; Classification algorithms; Feature extraction; Kernel; Principal component analysis; Support vector machines; Classification; feature construction; small data set; support vector machine.;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2010.254