• DocumentCode
    532244
  • Title

    Classifying skewed data streams based on reusing data

  • Author

    Liu, Peng ; Wang, Yong ; Cai, Lijun ; Zhang, Longbo

  • Author_Institution
    Sch. of Sci., Northwestern Polytech. Univ., Xi´´an, China
  • Volume
    4
  • fYear
    2010
  • fDate
    22-24 Oct. 2010
  • Abstract
    Current research community on data streams mining focuses on mining balanced data streams. However, the skewed class distribution appears in many data streams applications. In this paper, we introduce the method of discovering concept drifting on skewed data streams and propose an algorithm for classifying skewed data streams based on reusing data, RDFCSDS (Reuse Data for Classifying Skewed Data Streams). We evaluate RDFCSDS algorithm on Moving Hyperplane data set. The experiment results show that the sampling method based on reusing data works better than the simple sampling method and cluster sampling method on skewed data streams with concept drifting.
  • Keywords
    data mining; pattern classification; concept drifting; data stream mining; hyperplane data set; reuse data for classifying skewed data stream; sampling method; skewed class distribution; Educational institutions; Concept Drifting; Ensemble Classifiers; Skewed Distribution;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Application and System Modeling (ICCASM), 2010 International Conference on
  • Conference_Location
    Taiyuan
  • Print_ISBN
    978-1-4244-7235-2
  • Electronic_ISBN
    978-1-4244-7237-6
  • Type

    conf

  • DOI
    10.1109/ICCASM.2010.5620201
  • Filename
    5620201