Title :
The Relationship between Generalization Error and the Training Sample Number of SVM
Author :
Bai, Junqing ; Yan, Guirong ; Mao, Wentao
Author_Institution :
Key Lab. of Strength & Vibration of Minist. of Educ., Xi´´an Jiaotong Univ., Xi´´an, China
Abstract :
It is very important to construct the training set and determine the sample number in the regression problem. In this paper, a new idea of constructing the training set is elaborated. The key point of this idea is to choose the hyper-parameters before determining the training set. More importantly, a heuristic approach is proposed to select samples of support vector machine (SVM). Using these methods, the relationship between generalization error and the number of training samples on a given confidence level is computed. The empirical results on benchmark data (Boston Housing) and engineering data indicate that the proposed approach can give a reference to construct the proper training set. Moreover, the proposed approach has practical significance for other parametric learning machine.
Keywords :
generalisation (artificial intelligence); learning (artificial intelligence); regression analysis; support vector machines; SVM training sample; generalization error; heuristic approach; parametric learning machine; regression problem; sample number; support vector machine samples selection; training set construction; Accuracy; Benchmark testing; Data engineering; Laboratories; Least squares methods; Machine learning; Predictive models; Support vector machines; System performance; System testing; Generalization Error; Training Sample; support vector machine;
Conference_Titel :
Natural Computation, 2009. ICNC '09. Fifth International Conference on
Conference_Location :
Tianjin
Print_ISBN :
978-0-7695-3736-8
DOI :
10.1109/ICNC.2009.479