• DocumentCode
    3666715
  • Title

    Statistical estimation for Single Linkage Hierarchical Clustering

  • Author

    Dekang Zhu;Dan Guralnik;Xuezhi Wang;Xiang Li;Bill Moran

  • Author_Institution
    School of Electronic Science and Engineering, National University of Defense Technology, Changsha, China
  • fYear
    2015
  • fDate
    6/1/2015 12:00:00 AM
  • Firstpage
    745
  • Lastpage
    750
  • Abstract
    Clustering is a key component of most detectors of cyber-attacks, and increasingly, for both theoretical and practical reasons, methods that produce hierarchical clusterings (dendrograms) are being deployed in this context. In particular Single Linkage Hierarchical Clustering (SLHC) is attracting considerable interest. Existing clustering algorithms take no account of uncertainties in the data. In this paper, we derive a statistical model for the estimation of dendrograms, taking into account the uncertainty (through noise or corruption) in the distances among data points. We focus on just the estimation of the hierarchy of partitions afforded by the dendrogram, rather than the heights in the latter. The concept of estimating this "dendrogram structure" under SLHC is introduced, and an approximate maximum likelihood estimator (MLE) for the dendrogram structure is described. The proposed concept is demonstrated by a simple Monte Carlo simulation which demonstrates that the proposed MLE method performs better than SLHC in obtaining the correct structure.
  • Keywords
    "Measurement","Maximum likelihood estimation","Couplings","Clustering algorithms","Monte Carlo methods","Clustering methods"
  • Publisher
    ieee
  • Conference_Titel
    Cyber Technology in Automation, Control, and Intelligent Systems (CYBER), 2015 IEEE International Conference on
  • Print_ISBN
    978-1-4799-8728-3
  • Type

    conf

  • DOI
    10.1109/CYBER.2015.7288035
  • Filename
    7288035