DocumentCode
567056
Title
Distrim: Parallel GMM learning on multicore cluster
Author
Yang, Renyong ; Xiong, Tengke ; Chen, Tao ; Huang, Zhexue ; Feng, Shengzhong
Author_Institution
Shenzhen Inst. of Adv. Technol., Shenzhen, China
Volume
2
fYear
2012
fDate
25-27 May 2012
Firstpage
630
Lastpage
635
Abstract
Learning GMM model on extreme large data is challenging. We provide theoretical support for the feasibility of parallel EM-based GMM learning via distributed computing, and also design and implement a distributed memory sharing GMM learning system on multicore clusters, which is named as Distrim. Distrim aims to maximize the usage of computational power and minimize the communication overheads as much as possible. The experimental results show that Distrim is much more efficient than Hadoop, and also has a good scalability with respect to the number of computing nodes.
Keywords
Gaussian Mixture Model; MPI; distributed computing; memory sharing; parallel learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Automation Engineering (CSAE), 2012 IEEE International Conference on
Conference_Location
Zhangjiajie, China
Print_ISBN
978-1-4673-0088-9
Type
conf
DOI
10.1109/CSAE.2012.6272849
Filename
6272849
Link To Document