• DocumentCode
    3261068
  • Title

    Clustering-training for Data Stream Mining

  • Author

    Wu, Shuang ; Yang, Chunyu ; Zhou, Jie

  • Author_Institution
    Dept. of Autom., Tsinghua Univ., Beijing
  • fYear
    2006
  • fDate
    Dec. 2006
  • Firstpage
    653
  • Lastpage
    656
  • Abstract
    Mining data streams has attracted much attention recently. Labeled samples needed by most current stream classification methods are more difficult and expensive to obtain than unlabeled ones. This paper proposed a semi-supervised learning algorithm - clustering-training to utilize the unlabeled samples. It uses clustering to select confidently unlabeled samples, and uses them to re-train the classifier incrementally. Experiments on synthetic and real data set showed the effectiveness of the proposed algorithm
  • Keywords
    data mining; learning (artificial intelligence); pattern clustering; clustering-training; data set; data stream mining; semisupervised learning; unlabeled samples; Automation; Clustering algorithms; Data mining; Data processing; Databases; Labeling; Large-scale systems; Sampling methods; Semisupervised learning; Supervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshops, 2006. ICDM Workshops 2006. Sixth IEEE International Conference on
  • Conference_Location
    Hong Kong
  • Print_ISBN
    0-7695-2702-7
  • Type

    conf

  • DOI
    10.1109/ICDMW.2006.45
  • Filename
    4063706