DocumentCode
948395
Title
Maxi–Min Margin Machine: Learning Large Margin Classifiers Locally and Globally
Author
Huang, Kaizhu ; Yang, Haiqin ; King, Irwin ; Lyu, Michael R.
Author_Institution
Fujitsu R&D Center Co. Ltd., Beijing
Volume
19
Issue
2
fYear
2008
Firstpage
260
Lastpage
272
Abstract
In this paper, we propose a novel large margin classifier, called the maxi-min margin machine (M4). This model learns the decision boundary both locally and globally. In comparison, other large margin classifiers construct separating hyperplanes only either locally or globally. For example, a state-of-the-art large margin classifier, the support vector machine (SVM), considers data only locally, while another significant model, the minimax probability machine (MPM), focuses on building the decision hyperplane exclusively based on the global information. As a major contribution, we show that SVM yields the same solution as M4 when data satisfy certain conditions, and MPM can be regarded as a relaxation model of M4. Moreover, based on our proposed local and global view of data, another popular model, the linear discriminant analysis, can easily be interpreted and extended as well. We describe the M4 model definition, provide a geometrical interpretation, present theoretical justifications, and propose a practical sequential conic programming method to solve the optimization problem. We also show how to exploit Mercer kernels to extend M4 for nonlinear classifications. Furthermore, we perform a series of evaluations on both synthetic data sets and real-world benchmark data sets. Comparison with SVM and MPM demonstrates the advantages of our new model.
Keywords
boundary-value problems; computational geometry; learning (artificial intelligence); minimax techniques; nonlinear programming; pattern classification; probability; relaxation theory; support vector machines; Mercer kernels; decision boundaries; geometrical interpretation; large margin classifier learning; linear discriminant analysis; maxi-min margin machine; minimax probability machine; nonlinear classification; optimization problem; relaxation model; sequential conic programming method; support vector machines; Classification; kernel methods; large margin; learning locally and globally; second-order cone programming; Algorithms; Artificial Intelligence; Cluster Analysis; Neural Networks (Computer); Pattern Recognition, Automated;
fLanguage
English
Journal_Title
Neural Networks, IEEE Transactions on
Publisher
ieee
ISSN
1045-9227
Type
jour
DOI
10.1109/TNN.2007.905855
Filename
4359216
Link To Document