DocumentCode :
2689862
Title :
Data stream clustering and modeling using context-trees
Author :
Jiang, Wei ; Brice, Pierre
Author_Institution :
Dept. of Ind. Eng. & Logistics Manage., Hong Kong Univ. of Sci. & Technol., Hong Kong, China
fYear :
2009
fDate :
8-10 June 2009
Firstpage :
932
Lastpage :
937
Abstract :
Many applications such as telecommunication and commercial video broadcasting streams, computer systems logs, and web clicks are categorical or mixed-value data streams that exhibit context-dependency. Models that try to capture this context-dependency tend not to be scalable. This paper offers a solution to the scalability problem of these models by providing a method for generating them around relevant aggregates of these data streams rather than the individual samples. The approach expands existing clustering techniques for static categorical data sets to predictive models of data streams based on Variable Length Markov models of clusters. The paper includes theoretical and experimental evaluations of the technique as well as comparison with other prominent clustering techniques for categorical data streams.
Keywords :
Markov processes; statistical analysis; categorical data streams; context-trees; data stream clustering; variable length Markov models; Aggregates; Application software; Broadcasting; Clustering algorithms; Context modeling; Multimedia communication; Predictive models; Probability distribution; Statistical distributions; Streaming media; Anomaly Detection; Markov Chains; Trend Analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Service Systems and Service Management, 2009. ICSSSM '09. 6th International Conference on
Conference_Location :
Xiamen
Print_ISBN :
978-1-4244-3661-3
Electronic_ISBN :
978-1-4244-3662-0
Type :
conf
DOI :
10.1109/ICSSSM.2009.5175016
Filename :
5175016
Link To Document :
بازگشت