• DocumentCode
    567056
  • Title

    Distrim: Parallel GMM learning on multicore cluster

  • Author

    Yang, Renyong ; Xiong, Tengke ; Chen, Tao ; Huang, Zhexue ; Feng, Shengzhong

  • Author_Institution
    Shenzhen Inst. of Adv. Technol., Shenzhen, China
  • Volume
    2
  • fYear
    2012
  • fDate
    25-27 May 2012
  • Firstpage
    630
  • Lastpage
    635
  • Abstract
    Learning GMM model on extreme large data is challenging. We provide theoretical support for the feasibility of parallel EM-based GMM learning via distributed computing, and also design and implement a distributed memory sharing GMM learning system on multicore clusters, which is named as Distrim. Distrim aims to maximize the usage of computational power and minimize the communication overheads as much as possible. The experimental results show that Distrim is much more efficient than Hadoop, and also has a good scalability with respect to the number of computing nodes.
  • Keywords
    Gaussian Mixture Model; MPI; distributed computing; memory sharing; parallel learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Automation Engineering (CSAE), 2012 IEEE International Conference on
  • Conference_Location
    Zhangjiajie, China
  • Print_ISBN
    978-1-4673-0088-9
  • Type

    conf

  • DOI
    10.1109/CSAE.2012.6272849
  • Filename
    6272849