DocumentCode
3139968
Title
Identifying Structures with Informative Dimensions in Streams
Author
Bhatnagar, Vasudha ; Kaur, Sharanjit ; Gupta, Neelima
Author_Institution
Dept. of Comput. Sci., Univ. of Delhi, Delhi, India
fYear
2009
fDate
1-3 June 2009
Firstpage
375
Lastpage
382
Abstract
Discovering structures in streaming data is an important data mining task and has motivated design of several well known algorithms. However, in some applications, a higher level of analysis is desirable to reveal the set of dimensions which contribute heavily to the structures. In this paper, we propose an algorithm ISID (identifying structures with informative dimensions), which operates in the streaming environment and delivers clusteres along with dimensions that contribute significantly to these clusters. The algorithm uses a three stage approach and utilizes entropy in an innovative way to achieve the goal in four different ways, depending on the desired guarantees on structural richness or minimal dimension set for a cluster.The experimental results on synthetic and real data sets demonstrate the efficiency and effectiveness of the proposed algorithm.
Keywords
data mining; data structures; entropy; pattern clustering; data clustering; data streaming structure discovering; data structure; entropy; informative dimension; Algorithm design and analysis; Application software; Clustering algorithms; Computer science; Data mining; Entropy; Information science; Monitoring; Statistical distributions; Telephony; Clustering; Data streams; Entropy;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Information Science, 2009. ICIS 2009. Eighth IEEE/ACIS International Conference on
Conference_Location
Shanghai
Print_ISBN
978-0-7695-3641-5
Type
conf
DOI
10.1109/ICIS.2009.95
Filename
5222904
Link To Document