• DocumentCode
    2710009
  • Title

    Evolutionary Clustering by Hierarchical Dirichlet Process with Hidden Markov State

  • Author

    Tianbing Xu ; Zhongfei Zhang ; Yu, P.S. ; Long, Brenda

  • Author_Institution
    Dept. of Comput. Sci., State Univ. of New York at Binghamton, Binghamton, NY
  • fYear
    2008
  • fDate
    15-19 Dec. 2008
  • Firstpage
    658
  • Lastpage
    667
  • Abstract
    This paper studies evolutionary clustering, which is a recently hot topic with many important applications, noticeably in social network analysis. In this paper, based on the recent literature on Hierarchical Dirichlet Process (HDP) and Hidden Markov Model (HMM), we have developed a statistical model HDP-HTM that combines HDP with a Hierarchical Transition Matrix (HTM) based on the proposed Infinite Hierarchical Hidden Markov State model (iH2MS) as an effective solution to this problem. The HDP-HTM model substantially advances the literature on evolutionary clustering in the sense that not only it performs better than the existing literature, but more importantly it is capable of automatically learning the cluster numbers and structures and at the same time explicitly addresses the correspondence issue during the evolution. Extensive evaluations have demonstrated the effectiveness and promise of this solution against the state-of-the-art literature.
  • Keywords
    data mining; hidden Markov models; matrix algebra; data mining; evolutionary clustering; hierarchical Dirichlet process; hierarchical transition matrix; infinite hierarchical hidden Markov state model; social network analysis; Application software; Computer science; Data mining; Hidden Markov models; Information services; Internet; Social network services; Time sharing computer systems; USA Councils; Web sites; Evolutionary Clustering; HDP-HTM; Hierarchical Dirichlet Process; Infinite Hidden Markov Model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
  • Conference_Location
    Pisa
  • ISSN
    1550-4786
  • Print_ISBN
    978-0-7695-3502-9
  • Type

    conf

  • DOI
    10.1109/ICDM.2008.24
  • Filename
    4781161