Title :
Manipulation of prior probabilities in support vector classification
Author :
Cawley, Gavin C. ; Talbot, Nicola L C
Author_Institution :
Sch. of Inf. Syst., East Anglia Univ., Norwich, UK
Abstract :
Asymmetric margin error costs for positive and negative examples are often cited as an efficient heuristic compensating for unrepresentative priors in training support vector classifiers. In this paper we show that this heuristic is well justified via simple re-sampling ideas applied to the dual Lagrangian defining the 1-norm soft-margin support vector machine. This observation also provides a simple expression for the asymptotically optimal ratio of margin error penalties, eliminating the need for the trial-and-error experimentation normally encountered. This method allows the use of a smaller, balanced training data set in problems characterised by widely disparate prior probabilities, reducing the training time. The usefulness of this method is then demonstrated on a real world benchmark problem
Keywords :
learning (artificial intelligence); learning automata; neural nets; optimisation; pattern classification; probability; heuristics; learning time; margin error; pattern classification; probability; support vector classification; support vector machine; Cardiac disease; Costs; Frequency; Information systems; Lagrangian functions; Pattern recognition; Probability; Support vector machine classification; Support vector machines; Training data;
Conference_Titel :
Neural Networks, 2001. Proceedings. IJCNN '01. International Joint Conference on
Conference_Location :
Washington, DC
Print_ISBN :
0-7803-7044-9
DOI :
10.1109/IJCNN.2001.938748