Title :
Bucket Learning: Improving Model Quality through Enhancing Local Patterns
Author :
Qu, Guangzhi ; Wu, Hui
Author_Institution :
Comput. Sci. & Eng. Dept., Oakland Univ., Rochester, MI, USA
Abstract :
It is always desirable to improve the quality of a global classification model with the existence of other models. In this work, bucket learning methodology is first proposed to improve the model quality through enhancing its local patterns. We formally define the concept of slab as a tri-tuple <D, M, R>, which unifies the data view, model view and evaluation view of a data mining task. The bucket learning framework includes main modules of slab generation, short slab discovery, and short slab replacement as necessary steps to improve the model´s quality. Algorithms are designed to facilate the operations of quantifying the model merits, identifying the inferior local patterns and improving the global model. A prototype system is developed to verify the proposed methodology. The bucket learning prototype system is evaluated on 16 representative data sets from UCI data repository. Experimental results show that the improved model have an averaged F-measure of 79.5% with an improvement of 7.3% from the original model learned by J48.
Keywords :
data mining; learning (artificial intelligence); F-measure; bucket learning; data mining; global classification model; local pattern enhancement; model quality; short slab discovery; short slab replacement; slab generation; Computer science; Conferences; Data mining; Data privacy; Detection algorithms; Distributed algorithms; Monitoring; NASA; Space technology; Statistical distributions;
Conference_Titel :
Data Mining Workshops, 2009. ICDMW '09. IEEE International Conference on
Conference_Location :
Miami, FL
Print_ISBN :
978-1-4244-5384-9
Electronic_ISBN :
978-0-7695-3902-7
DOI :
10.1109/ICDMW.2009.66