DocumentCode
3261068
Title
Clustering-training for Data Stream Mining
Author
Wu, Shuang ; Yang, Chunyu ; Zhou, Jie
Author_Institution
Dept. of Autom., Tsinghua Univ., Beijing
fYear
2006
fDate
Dec. 2006
Firstpage
653
Lastpage
656
Abstract
Mining data streams has attracted much attention recently. Labeled samples needed by most current stream classification methods are more difficult and expensive to obtain than unlabeled ones. This paper proposed a semi-supervised learning algorithm - clustering-training to utilize the unlabeled samples. It uses clustering to select confidently unlabeled samples, and uses them to re-train the classifier incrementally. Experiments on synthetic and real data set showed the effectiveness of the proposed algorithm
Keywords
data mining; learning (artificial intelligence); pattern clustering; clustering-training; data set; data stream mining; semisupervised learning; unlabeled samples; Automation; Clustering algorithms; Data mining; Data processing; Databases; Labeling; Large-scale systems; Sampling methods; Semisupervised learning; Supervised learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining Workshops, 2006. ICDM Workshops 2006. Sixth IEEE International Conference on
Conference_Location
Hong Kong
Print_ISBN
0-7695-2702-7
Type
conf
DOI
10.1109/ICDMW.2006.45
Filename
4063706
Link To Document