• DocumentCode
    3301068
  • Title

    Distributed Latent Dirichlet allocation for objects-distributed cluster ensemble

  • Author

    Wang, Hongjun ; Li, Zhishu ; Cheng, Yang

  • Author_Institution
    Sch. of Comput. Sci., Sichuan Univ., Chengdu
  • fYear
    2008
  • fDate
    19-22 Oct. 2008
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    The paper introduces the model of distributed latent Dirichlet location (D-LDA) for objects-distributed cluster ensemble which can handle the problems of privacy preservation, distributed computing and knowledge reuse. First, the latent variables in D-LDA and some terminologies are defined for cluster ensemble. Second, Markov chain Monte Carlo (MCMC) approximation inference for D-LDA is stated in detail. Third, some datasets from UCI are chosen for experiment, Compared with cluster-based similarity partitioning algorithm (CSPA), hyper-graph partitioning algorithm (HGPA) and meta-clustering algorithm (MCLA), the results show D-LDA does work better, furthermore the outputs of D-LDA, as a soft cluster model, can not only cluster the data points but also show the structure of data points.
  • Keywords
    Markov processes; Monte Carlo methods; approximation theory; data privacy; distributed processing; pattern clustering; Markov chain Monte Carlo approximation inference; UCI; distributed computing; distributed latent Dirichlet allocation; knowledge reusing; objects-distributed cluster ensemble; privacy preservation; soft cluster model; Clustering algorithms; Computer science; Data mining; Data privacy; Distributed computing; Inference algorithms; Machine learning algorithms; Partitioning algorithms; Robust stability; Terminology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2008. NLP-KE '08. International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-4515-8
  • Electronic_ISBN
    978-1-4244-2780-2
  • Type

    conf

  • DOI
    10.1109/NLPKE.2008.4906792
  • Filename
    4906792