DocumentCode
60713
Title
Multiobjective Optimization for Model Selection in Kernel Methods in Regression
Author
Di You ; Benitez-Quiroz, Carlos Fabian ; Martinez, Ana Milena
Author_Institution
Motorola Mobility, Libertyville, IL, USA
Volume
25
Issue
10
fYear
2014
fDate
Oct. 2014
Firstpage
1879
Lastpage
1893
Abstract
Regression plays a major role in many scientific and engineering problems. The goal of regression is to learn the unknown underlying function from a set of sample vectors with known outcomes. In recent years, kernel methods in regression have facilitated the estimation of nonlinear functions. However, two major (interconnected) problems remain open. The first problem is given by the bias-versus-variance tradeoff. If the model used to estimate the underlying function is too flexible (i.e., high model complexity), the variance will be very large. If the model is fixed (i.e., low complexity), the bias will be large. The second problem is to define an approach for selecting the appropriate parameters of the kernel function. To address these two problems, this paper derives a new smoothing kernel criterion, which measures the roughness of the estimated function as a measure of model complexity. Then, we use multiobjective optimization to derive a criterion for selecting the parameters of that kernel. The goal of this criterion is to find a tradeoff between the bias and the variance of the learned function. That is, the goal is to increase the model fit while keeping the model complexity in check. We provide extensive experimental evaluations using a variety of problems in machine learning, pattern recognition, and computer vision. The results demonstrate that the proposed approach yields smaller estimation errors as compared with methods in the state of the art.
Keywords
nonlinear estimation; nonlinear functions; optimisation; regression analysis; bias-versus-variance tradeoff; computer vision; estimated function; estimation error; experimental evaluation; kernel function; kernel methods; machine learning; model complexity; model selection; multiobjective optimization; nonlinear function estimation; pattern recognition; regression; smoothing kernel criterion; Complexity theory; Computational modeling; Kernel; Noise; Optimization; Training; Vectors; Kernel methods; Pareto optimality; kernel optimization; optimization; regression; regression.;
fLanguage
English
Journal_Title
Neural Networks and Learning Systems, IEEE Transactions on
Publisher
ieee
ISSN
2162-237X
Type
jour
DOI
10.1109/TNNLS.2013.2297686
Filename
6712159
Link To Document