DocumentCode :
3128768
Title :
Drift Detection Using Uncertainty Distribution Divergence
Author :
Lindstrom, Patrick ; Namee, Brian Mac ; Delany, Sarah Jane
Author_Institution :
Sch. of Comput., Dublin Inst. of Technol., Dublin, Ireland
fYear :
2011
fDate :
11-11 Dec. 2011
Firstpage :
604
Lastpage :
608
Abstract :
Concept drift is believed to be prevalent in most data gathered from naturally occurring processes and thus warrants research by the machine learning community. There are a myriad of approaches to concept drift handling which have been shown to handle concept drift with varying degrees of success. However, most approaches make the key assumption that the labelled data will be available at no labelling cost shortly after classification, an assumption which is often violated. The high labelling cost in many domains provides a strong motivation to reduce the number of labelled instances required to handle concept drift. Explicit detection approaches that do not require labelled instances to detect concept drift show great promise for achieving this. Our approach Confidence Distribution Batch Detection (CDBD) provides a signal correlated to changes in concept without using labelled data. We also show how this signal combined with a trigger and a rebuild policy can maintain classifier accuracy while using a limited amount of labelled data.
Keywords :
data handling; learning (artificial intelligence); CDBD; concept drift; confidence distribution batch detection; drift detection; explicit detection; machine learning; uncertainty distribution divergence; Accuracy; Conferences; Data mining; Labeling; Machine learning; Training; Training data; classifier confidence; concept drift; explicit drift detection; labelling cost;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
978-1-4673-0005-6
Type :
conf
DOI :
10.1109/ICDMW.2011.70
Filename :
6137435
Link To Document :
بازگشت