DocumentCode
478209
Title
The Data Selection Criteria for HSC and SVM Algorithms
Author
He, Qing ; Zhuang, Fuzhen ; Shi, Zhongzhi
Author_Institution
Key Lab. of Intell. Inf. Process., Chinese Acad. of Sci., Beijing
Volume
3
fYear
2008
fDate
18-20 Oct. 2008
Firstpage
384
Lastpage
388
Abstract
This paper makes a discussion of consistent subsets (CS) selection criteria for hyper surface Classification (HSC) and SVM algorithms. The consistent subsets play an important role in the data selection. Firstly, the paper proposes that minimal consistent subset for a disjoint cover set (MCSC) plays an important role in the data selection for HSC. The MCSC can be applied to select a representative subset from the original sample set for HSC. MCSC has the same classification model with the entire sample set and can totally reflect its classification ability. Secondly, the number of MCSC is calculated. Thirdly, by comparing the performance of HSC and SVM on corresponding CS, we argue that it is not reasonable that using the same train data set to train different classifiers and then testing the classifiers by the same test data set for different algorithms. The experiments show that algorithms can respectively select the proper data set for training, which ensures good performance and generalization ability. MCSC is the best selection for HSC, and support vector set is the effective selection for SVM.
Keywords
pattern classification; support vector machines; HSC; SVM; consaistent subsets selection criteria; data selection criteria; generalization ability; hyper surface classification; minimal consistent subset for a disjoint cover set; Computers; Databases; Helium; Information processing; Laboratories; Nearest neighbor searches; Neural networks; Support vector machine classification; Support vector machines; Testing; Consistent Subsets (CS); Data Selection Criteria; Minimal Consistent Subset for a disjoint Cover set;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Computation, 2008. ICNC '08. Fourth International Conference on
Conference_Location
Jinan
Print_ISBN
978-0-7695-3304-9
Type
conf
DOI
10.1109/ICNC.2008.334
Filename
4667166
Link To Document