DocumentCode :
3139968
Title :
Identifying Structures with Informative Dimensions in Streams
Author :
Bhatnagar, Vasudha ; Kaur, Sharanjit ; Gupta, Neelima
Author_Institution :
Dept. of Comput. Sci., Univ. of Delhi, Delhi, India
fYear :
2009
fDate :
1-3 June 2009
Firstpage :
375
Lastpage :
382
Abstract :
Discovering structures in streaming data is an important data mining task and has motivated design of several well known algorithms. However, in some applications, a higher level of analysis is desirable to reveal the set of dimensions which contribute heavily to the structures. In this paper, we propose an algorithm ISID (identifying structures with informative dimensions), which operates in the streaming environment and delivers clusteres along with dimensions that contribute significantly to these clusters. The algorithm uses a three stage approach and utilizes entropy in an innovative way to achieve the goal in four different ways, depending on the desired guarantees on structural richness or minimal dimension set for a cluster.The experimental results on synthetic and real data sets demonstrate the efficiency and effectiveness of the proposed algorithm.
Keywords :
data mining; data structures; entropy; pattern clustering; data clustering; data streaming structure discovering; data structure; entropy; informative dimension; Algorithm design and analysis; Application software; Clustering algorithms; Computer science; Data mining; Entropy; Information science; Monitoring; Statistical distributions; Telephony; Clustering; Data streams; Entropy;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Science, 2009. ICIS 2009. Eighth IEEE/ACIS International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-0-7695-3641-5
Type :
conf
DOI :
10.1109/ICIS.2009.95
Filename :
5222904
Link To Document :
بازگشت