Clustering-training for Data Stream Mining

Author

Wu, Shuang ; Yang, Chunyu ; Zhou, Jie

Author_Institution

Dept. of Autom., Tsinghua Univ., Beijing

fYear

2006

fDate

Dec. 2006

Firstpage

653

Lastpage

656

Abstract

Mining data streams has attracted much attention recently. Labeled samples needed by most current stream classification methods are more difficult and expensive to obtain than unlabeled ones. This paper proposed a semi-supervised learning algorithm - clustering-training to utilize the unlabeled samples. It uses clustering to select confidently unlabeled samples, and uses them to re-train the classifier incrementally. Experiments on synthetic and real data set showed the effectiveness of the proposed algorithm

Keywords

data mining; learning (artificial intelligence); pattern clustering; clustering-training; data set; data stream mining; semisupervised learning; unlabeled samples; Automation; Clustering algorithms; Data mining; Data processing; Databases; Labeling; Large-scale systems; Sampling methods; Semisupervised learning; Supervised learning;

fLanguage

English

Publisher

ieee

Conference_Titel

Data Mining Workshops, 2006. ICDM Workshops 2006. Sixth IEEE International Conference on

Conference_Location

Hong Kong

Print_ISBN

0-7695-2702-7

Type

conf

DOI

10.1109/ICDMW.2006.45

Filename

4063706

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=3261068