Title :
A DCT based approach for detecting novelty and concept drift in data streams
Author :
Hayat, Morteza Zi ; Hashemi, Mahmoud Reza
Author_Institution :
Sch. of Electr. & Comput. Eng., Univ. of Tehran, Tehran, Iran
Abstract :
Data streams are one of the most challenging environments for machine learning. In many applications, the high volume data streams have an inherent concept drift over time. Identifying novel classes and detecting the occurrence of concept drift in such an environment is a major challenge. In this paper, a new method has been proposed to detect novelty and handle concept drift with limited required memory and storage space. The method is based on clustering algorithm. It uses Discrete Cosine Transform to build compact generative models which are then used to detect novel classes and concept drift effectively. The proposed method has been evaluated with seven common data sets from various domains. The results indicate its superior performance when compared with existing methods in terms of novelty and drift detection, computational complexity and memory requirements.
Keywords :
computational complexity; discrete cosine transforms; learning (artificial intelligence); pattern clustering; DCT; clustering algorithm; compact generative model; computational complexity; concept drift; data stream; discrete cosine transform; machine learning; novelty detection; Clustering algorithms; Computational modeling; Data models; Discrete cosine transforms; Fitting; Memory management; Streaming media; Classification; Clustering; Concept drift; Data stream; Novelty detection;
Conference_Titel :
Soft Computing and Pattern Recognition (SoCPaR), 2010 International Conference of
Conference_Location :
Paris
Print_ISBN :
978-1-4244-7897-2
DOI :
10.1109/SOCPAR.2010.5686734