• DocumentCode
    3139968
  • Title

    Identifying Structures with Informative Dimensions in Streams

  • Author

    Bhatnagar, Vasudha ; Kaur, Sharanjit ; Gupta, Neelima

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Delhi, Delhi, India
  • fYear
    2009
  • fDate
    1-3 June 2009
  • Firstpage
    375
  • Lastpage
    382
  • Abstract
    Discovering structures in streaming data is an important data mining task and has motivated design of several well known algorithms. However, in some applications, a higher level of analysis is desirable to reveal the set of dimensions which contribute heavily to the structures. In this paper, we propose an algorithm ISID (identifying structures with informative dimensions), which operates in the streaming environment and delivers clusteres along with dimensions that contribute significantly to these clusters. The algorithm uses a three stage approach and utilizes entropy in an innovative way to achieve the goal in four different ways, depending on the desired guarantees on structural richness or minimal dimension set for a cluster.The experimental results on synthetic and real data sets demonstrate the efficiency and effectiveness of the proposed algorithm.
  • Keywords
    data mining; data structures; entropy; pattern clustering; data clustering; data streaming structure discovering; data structure; entropy; informative dimension; Algorithm design and analysis; Application software; Clustering algorithms; Computer science; Data mining; Entropy; Information science; Monitoring; Statistical distributions; Telephony; Clustering; Data streams; Entropy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Science, 2009. ICIS 2009. Eighth IEEE/ACIS International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-0-7695-3641-5
  • Type

    conf

  • DOI
    10.1109/ICIS.2009.95
  • Filename
    5222904