• DocumentCode
    2000397
  • Title

    Pre-clustering algorithm for anomaly detection and clustering that uses variable size buckets

  • Author

    Sharma, Manish ; Toshniwal, Durga

  • Author_Institution
    Electron. & Comput. Eng., Indian Inst. of Technol. Roorkee, Roorkee, India
  • fYear
    2012
  • fDate
    15-17 March 2012
  • Firstpage
    515
  • Lastpage
    519
  • Abstract
    Clustering is known as grouping of data based on their similarities. This paper introduces an algorithm of k means for clustering of data streams and detection of outliers. The introduced technique for detection of outliers is based on distance as well as on time on which they arrive in the cluster. This paper also takes into account the selection of k centers and variable size of buckets with the help of which space can be effectively utilized during clustering. Most traditional algorithms make clustering a very difficult problem by reducing their quality for a better efficiency. This paper indicates that with a small increase in time you can efficiently cluster the data without much loss of quality of data.
  • Keywords
    pattern clustering; anomaly detection; data grouping; data quality; data streams; outlier detection; preclustering algorithm; variable size buckets; Algorithm design and analysis; Clustering algorithms; Data mining; Heuristic algorithms; Information technology; Intrusion detection; Iris; anomaly detection; boolean data; categorial data; clustering; k means;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Recent Advances in Information Technology (RAIT), 2012 1st International Conference on
  • Conference_Location
    Dhanbad
  • Print_ISBN
    978-1-4577-0694-3
  • Type

    conf

  • DOI
    10.1109/RAIT.2012.6194613
  • Filename
    6194613