Title :
Expected Classification Error of the Euclidean Linear Classifier under Sudden Concept Drift
Author :
Zliobaite, Indre
Author_Institution :
Fac. of Math. & Inf., Vilnius Univ., Vilnius
Abstract :
We look at binary online classification in the light of sudden concept drift (data exhibits non-stationarity). The accuracy of the classifier trained on a mixture of old and new data is compared to the accuracy of the classifier trained only on new data, assuming known point of concept drift. We employ a simplified model of concept drift and derive theoretical generalization error for the Euclidean linear classifier. Right after concept drift the retrained classifier is more accurate than the new classifier, especially in cases when the data is complex (high dimensionality, low separability). The new classifier should be preferred when the extent of drift is very large.
Keywords :
data mining; pattern classification; Euclidean linear classifier; binary online classification; expected classification error; sudden concept drift; Buildings; Finance; Frequency shift keying; Fuzzy systems; Informatics; Mathematics; Numerical analysis; Pattern recognition; Probability density function; Terminology; Concept drift; Euclidean classifier; changing environment; nearest mean classifier;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2008. FSKD '08. Fifth International Conference on
Conference_Location :
Shandong
Print_ISBN :
978-0-7695-3305-6
DOI :
10.1109/FSKD.2008.305