• DocumentCode
    2050201
  • Title

    Parallelization of module network structure learning and performance tuning on SMP

  • Author

    Jiang, Hongshan ; Lai, Chunrong ; Chen, Wenguang ; Chen, Yurong ; Hu, Wei ; Zheng, Weimin ; Zhang, Yimin

  • Author_Institution
    Dept. of Comput. Sci., Tsinghua Univ., Beijing, China
  • fYear
    2006
  • fDate
    25-29 April 2006
  • Abstract
    As an extension of Bayesian network, module network is an appropriate model for inferring causal network of a mass of variables from insufficient evidences. However learning such a model is still a time-consuming process. In this paper, we propose a parallel implementation of module network learning algorithm using OpenMP. We propose a static task partitioning strategy which distributes sub-search-spaces over worker threads to get the tradeoff between load-balance and software-cache-contention. To overcome performance penalties derived from shared-memory contention, we adopt several optimization techniques such as memory pre-allocation, memory alignment and static function usage. These optimizations have different patterns of influence on the sequential performance and the parallel speedup. Experiments validate the effectiveness of these optimizations. For a 2,200 nodes dataset, they enhance the parallel speedup up to 88%, together with a 2X sequential performance improvement. With resource contentions reduced, workload imbalance becomes the main hurdle to parallel scalability and the program behaviors more stable in various platforms.
  • Keywords
    belief networks; learning (artificial intelligence); multiprocessing systems; parallel processing; Bayesian network; OpenMP; load-balancing; memory alignment; memory preallocation; module network learning algorithm; module network structure learning; optimization; parallel speedup; performance tuning; sequential performance improvement; shared-memory contention; software-cache-contention; static function usage; static task partitioning strategy; workload imbalance; Bayesian methods; Bioinformatics; Computer science; Multiprocessing systems; Partitioning algorithms; Scalability; Speech processing; Stochastic processes; Text mining; Yarn;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium, 2006. IPDPS 2006. 20th International
  • Print_ISBN
    1-4244-0054-6
  • Type

    conf

  • DOI
    10.1109/IPDPS.2006.1639610
  • Filename
    1639610