Title :
Distributed clustering using distributed mixture of probabilistic PCA
Author :
Hang Yin ; Chunhong Zhang ; Yang Ji
Author_Institution :
Sch. of Inf. & Commun. Eng., Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
This paper considers the clustering algorithm based on mixture of probabilistic principle component analyzers in distributed environments. The EM procedure of it is first transformed to a summing variant. Following the classic distributed EM framework for mixture of exponential family distributions and utilizing the summing variant, we propose the distributed EM for mixture of probabilistic principle component analyzers. The proposed algorithm avoids transferring all data from distributed nodes to a central node. Experiment verifies the validity and feasibility of the proposed method. For some datasets, the proposed method can even enhance the log-likelihood as well as the clustering performance.
Keywords :
data mining; distributed processing; mixture models; pattern clustering; principal component analysis; statistical distributions; central node; clustering performance enhancement; distributed EM framework; distributed clustering algorithm; distributed mixture; distributed nodes; exponential family distributions; log-likelihood enhancement; probabilistic PCA; probabilistic principle component analyzers; summing variant; Clustering algorithms; Convergence; Data models; Distributed databases; Principal component analysis; Probabilistic logic; Vectors; Distributed Clustering; Distributed Data Mining; Distributed EM; Mixture of Probabilistic PCA;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2014 11th International Conference on
Conference_Location :
Xiamen
Print_ISBN :
978-1-4799-5147-5
DOI :
10.1109/FSKD.2014.6980859