مرکز منطقه ای اطلاع رساني علوم و فناوري - Predictive Ensemble Pruning by Expectation Propagation

DocumentCode :

1186169

Title :

Predictive Ensemble Pruning by Expectation Propagation

Author :

Chen, Huanhuan ; Tiho, P. ; Yao, Xin

Author_Institution :

Sch. of Comput. Sci., Univ. of Birmingham, Birmingham

Volume :

Issue :

fYear :

2009

fDate :

7/1/2009 12:00:00 AM

Firstpage :

999

Lastpage :

1013

Abstract :

An ensemble is a group of learners that work together as a committee to solve a problem. The existing ensemble learning algorithms often generate unnecessarily large ensembles, which consume extra computational resource and may degrade the generalization performance. Ensemble pruning algorithms aim to find a good subset of ensemble members to constitute a small ensemble, which saves the computational resource and performs as well as, or better than, the unpruned ensemble. This paper introduces a probabilistic ensemble pruning algorithm by choosing a set of ldquosparserdquo combination weights, most of which are zeros, to prune the ensemble. In order to obtain the set of sparse combination weights and satisfy the nonnegative constraint of the combination weights, a left-truncated, nonnegative, Gaussian prior is adopted over every combination weight. Expectation propagation (EP) algorithm is employed to approximate the posterior estimation of the weight vector. The leave-one-out (LOO) error can be obtained as a by-product in the training of EP without extra computation and is a good indication for the generalization error. Therefore, the LOO error is used together with the Bayesian evidence for model selection in this algorithm. An empirical study on several regression and classification benchmark data sets shows that our algorithm utilizes far less component learners but performs as well as, or better than, the unpruned ensemble. Our results are very competitive compared with other ensemble pruning algorithms.

Keywords :

Bayes methods; Gaussian processes; estimation theory; generalisation (artificial intelligence); learning (artificial intelligence); pattern classification; probability; regression analysis; set theory; Bayesian evidence; Gaussian posterior estimation; computational resource; data set theory; expectation propagation algorithm; generalization performance; leave-one-out error; pattern classification; predictive ensemble learning algorithm; probabilistic ensemble pruning algorithm; regression analysis; sparse combination weight; Machine learning; classification.; ensemble learning; probabilistic algorithms; regression;

fLanguage :

English

Journal_Title :

Knowledge and Data Engineering, IEEE Transactions on

Publisher :

ieee

ISSN :

1041-4347

Type :

jour

DOI :

10.1109/TKDE.2009.62

Filename :

4798164

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1186169