• DocumentCode
    399776
  • Title

    TECNO-STREAMS: tracking evolving clusters in noisy data streams with a scalable immune system learning model

  • Author

    Nasraoui, Olfa ; Uribe, Cesar Cardona ; Coronel, Carlos Rojas ; Gonzalez, Fabio

  • Author_Institution
    Dept. of Electr. & Comput. Eng., The Univ. of Memphis, TN, USA
  • fYear
    2003
  • fDate
    19-22 Nov. 2003
  • Firstpage
    235
  • Lastpage
    242
  • Abstract
    Artificial immune system (AIS) models hold many promises in the field of unsupervised learning. However, existing models are not scalable, which makes them of limited use in data mining. We propose a new AIS based clustering approach (TECNO-STREAMS) that addresses the weaknesses of current AIS models. Compared to existing AIS based techniques, our approach exhibits superior learning abilities, while at the same time, requiring low memory and computational costs. Like the natural immune system, the strongest advantage of immune based learning compared to other approaches is expected to be its ease of adaptation to the dynamic environment that characterizes several applications, particularly in mining data streams. We illustrate the ability of the proposed approach in detecting clusters in noisy data sets, and in mining evolving user profiles from Web clickstream data in a single pass. TECNO-STREAMS adheres to all the requirements of clustering data streams: compactness of representation, fast incremental processing of new data points, and clear and fast identification of outliers.
  • Keywords
    data mining; pattern clustering; unsupervised learning; AIS; TECNO-STREAMS approach; Web clickstream data; artificial immune system model; cluster detection; data mining; dynamic environment; noisy data set; unsupervised learning; user profile; Artificial immune systems; Cloning; Computational efficiency; Data mining; Euclidean distance; Immune system; Pathogens; Power generation; Proteins; Unsupervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
  • Print_ISBN
    0-7695-1978-4
  • Type

    conf

  • DOI
    10.1109/ICDM.2003.1250925
  • Filename
    1250925