DocumentCode
182992
Title
Distributed clustering using distributed mixture of probabilistic PCA
Author
Hang Yin ; Chunhong Zhang ; Yang Ji
Author_Institution
Sch. of Inf. & Commun. Eng., Beijing Univ. of Posts & Telecommun., Beijing, China
fYear
2014
fDate
19-21 Aug. 2014
Firstpage
352
Lastpage
357
Abstract
This paper considers the clustering algorithm based on mixture of probabilistic principle component analyzers in distributed environments. The EM procedure of it is first transformed to a summing variant. Following the classic distributed EM framework for mixture of exponential family distributions and utilizing the summing variant, we propose the distributed EM for mixture of probabilistic principle component analyzers. The proposed algorithm avoids transferring all data from distributed nodes to a central node. Experiment verifies the validity and feasibility of the proposed method. For some datasets, the proposed method can even enhance the log-likelihood as well as the clustering performance.
Keywords
data mining; distributed processing; mixture models; pattern clustering; principal component analysis; statistical distributions; central node; clustering performance enhancement; distributed EM framework; distributed clustering algorithm; distributed mixture; distributed nodes; exponential family distributions; log-likelihood enhancement; probabilistic PCA; probabilistic principle component analyzers; summing variant; Clustering algorithms; Convergence; Data models; Distributed databases; Principal component analysis; Probabilistic logic; Vectors; Distributed Clustering; Distributed Data Mining; Distributed EM; Mixture of Probabilistic PCA;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery (FSKD), 2014 11th International Conference on
Conference_Location
Xiamen
Print_ISBN
978-1-4799-5147-5
Type
conf
DOI
10.1109/FSKD.2014.6980859
Filename
6980859
Link To Document