DocumentCode
532244
Title
Classifying skewed data streams based on reusing data
Author
Liu, Peng ; Wang, Yong ; Cai, Lijun ; Zhang, Longbo
Author_Institution
Sch. of Sci., Northwestern Polytech. Univ., Xi´´an, China
Volume
4
fYear
2010
fDate
22-24 Oct. 2010
Abstract
Current research community on data streams mining focuses on mining balanced data streams. However, the skewed class distribution appears in many data streams applications. In this paper, we introduce the method of discovering concept drifting on skewed data streams and propose an algorithm for classifying skewed data streams based on reusing data, RDFCSDS (Reuse Data for Classifying Skewed Data Streams). We evaluate RDFCSDS algorithm on Moving Hyperplane data set. The experiment results show that the sampling method based on reusing data works better than the simple sampling method and cluster sampling method on skewed data streams with concept drifting.
Keywords
data mining; pattern classification; concept drifting; data stream mining; hyperplane data set; reuse data for classifying skewed data stream; sampling method; skewed class distribution; Educational institutions; Concept Drifting; Ensemble Classifiers; Skewed Distribution;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Application and System Modeling (ICCASM), 2010 International Conference on
Conference_Location
Taiyuan
Print_ISBN
978-1-4244-7235-2
Electronic_ISBN
978-1-4244-7237-6
Type
conf
DOI
10.1109/ICCASM.2010.5620201
Filename
5620201
Link To Document