• DocumentCode
    2436636
  • Title

    Distributed and Parallelled EM Algorithm for Distributed Cluster Ensemble

  • Author

    Wang, Hongjun ; Li, Zhishu ; Cheng, Yang

  • Author_Institution
    Sch. of Comput. Sci., Sichuan Univ., Chengdu
  • Volume
    2
  • fYear
    2008
  • fDate
    19-20 Dec. 2008
  • Firstpage
    3
  • Lastpage
    8
  • Abstract
    The paper introduces base clusterings distributed cluster ensemble which can handle the problems of privacy preservation, distributed computing and knowledge reuse. First, the latent variables in latent Dirichlet location model for cluster ensemble (LDA-CE) are defined and some terminologies are defined. Second, Variational approximation inference for LDA-CE is stated in detail. Third, base on the variational approximation inference, we design a distributed and paralleled EM algorithm for cluster ensemble (DPEM). Finally, some datasets from UCI are chosen for experiment, Compared with cluster-based similarity partitioning algorithm (CSPA), hyper-graph partitioning algorithm(HGPA) and meta-clustering algorithm(MCLA), the results show DPEM algorithm does work better and DPEM can work distributed and paralleled, so DPEM can protect privacy information more and can save time.
  • Keywords
    data privacy; data visualisation; expectation-maximisation algorithm; inference mechanisms; parallel algorithms; pattern clustering; data visualization; distributed cluster ensemble; distributed computing; knowledge reuse; latent Dirichlet location model; parallelled EM algorithm; privacy preservation; variational approximation inference; Clustering algorithms; Computer industry; Computer science; Data privacy; Distributed computing; Inference algorithms; Machine learning algorithms; Matrix converters; Partitioning algorithms; Robust stability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Industrial Application, 2008. PACIIA '08. Pacific-Asia Workshop on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-0-7695-3490-9
  • Type

    conf

  • DOI
    10.1109/PACIIA.2008.346
  • Filename
    4756723