• DocumentCode
    2195106
  • Title

    Clustering Performance on Evolving Data Streams: Assessing Algorithms and Evaluation Measures within MOA

  • Author

    Kranen, Philipp ; Kremer, Hardy ; Jansen, Timm ; Seidl, Thomas ; Bifet, Albert ; Holmes, Graham ; Pfahringer, Bernhard

  • Author_Institution
    Data Manage. & Data Exploration Group, RWTH Aachen Univ., Aachen, Germany
  • fYear
    2010
  • fDate
    13-13 Dec. 2010
  • Firstpage
    1400
  • Lastpage
    1403
  • Abstract
    In today´s applications, evolving data streams are ubiquitous. Stream clustering algorithms were introduced to gain useful knowledge from these streams in real-time. The quality of the obtained clusterings, i.e. how good they reflect the data, can be assessed by evaluation measures. A multitude of stream clustering algorithms and evaluation measures for clusterings were introduced in the literature, however, until now there is no general tool for a direct comparison of the different algorithms or the evaluation measures. In our demo, we present a novel experimental framework for both tasks. It offers the means for extensive evaluation and visualization and is an extension of the Massive Online Analysis (MOA) software environment released under the GNU GPL License.
  • Keywords
    data handling; pattern clustering; GNU GPL license; MOA; assessing algorithm; data visualization; evaluation measure; evolving data stream; massive online analysis; stream clustering; clustering; data streams; evaluation measures;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshops (ICDMW), 2010 IEEE International Conference on
  • Conference_Location
    Sydney, NSW
  • Print_ISBN
    978-1-4244-9244-2
  • Electronic_ISBN
    978-0-7695-4257-7
  • Type

    conf

  • DOI
    10.1109/ICDMW.2010.17
  • Filename
    5693462