Title :
Using Boosting and Clustering to Prune Bagging and Detect Noisy Data
Author :
Xie, Yuan Cheng ; Yang, Jing Yu
Author_Institution :
Sch. of Comput. Sci. & Technol., Nanjing Univ. of Sci. & Technol., Nanjing, China
Abstract :
AdaBoost has been the representation of ensemble learning algorithm because of its excellent performance. However, due to its longtime training, AdaBoost was complained about by people and this defect limits the practical application. Bagging is a rapid method of training and supports for parallel computing. One of important factors that can affect the performance of ensemble learning is the diversity of component learners. Based on this view, a new algorithm using clustering and boosting to prune Bagging ensembles is proposed in this paper. Its learning efficiency is close to Bagging and its performance is close to AdaBoost. Furthermore, this new algorithm can detect noisy data from original samples based on cascade technique, and a better result of noise detection can be acquired.
Keywords :
learning (artificial intelligence); pattern clustering; AdaBoost; data clustering; ensemble learning; noisy data detection; parallel computing; prune bagging; Bagging; Boosting; Clustering algorithms; Computer science; Educational institutions; Information science; Neural networks; Parallel processing; Reactive power;
Conference_Titel :
Pattern Recognition, 2009. CCPR 2009. Chinese Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4244-4199-0
DOI :
10.1109/CCPR.2009.5344126