DocumentCode :
2050201
Title :
Parallelization of module network structure learning and performance tuning on SMP
Author :
Jiang, Hongshan ; Lai, Chunrong ; Chen, Wenguang ; Chen, Yurong ; Hu, Wei ; Zheng, Weimin ; Zhang, Yimin
Author_Institution :
Dept. of Comput. Sci., Tsinghua Univ., Beijing, China
fYear :
2006
fDate :
25-29 April 2006
Abstract :
As an extension of Bayesian network, module network is an appropriate model for inferring causal network of a mass of variables from insufficient evidences. However learning such a model is still a time-consuming process. In this paper, we propose a parallel implementation of module network learning algorithm using OpenMP. We propose a static task partitioning strategy which distributes sub-search-spaces over worker threads to get the tradeoff between load-balance and software-cache-contention. To overcome performance penalties derived from shared-memory contention, we adopt several optimization techniques such as memory pre-allocation, memory alignment and static function usage. These optimizations have different patterns of influence on the sequential performance and the parallel speedup. Experiments validate the effectiveness of these optimizations. For a 2,200 nodes dataset, they enhance the parallel speedup up to 88%, together with a 2X sequential performance improvement. With resource contentions reduced, workload imbalance becomes the main hurdle to parallel scalability and the program behaviors more stable in various platforms.
Keywords :
belief networks; learning (artificial intelligence); multiprocessing systems; parallel processing; Bayesian network; OpenMP; load-balancing; memory alignment; memory preallocation; module network learning algorithm; module network structure learning; optimization; parallel speedup; performance tuning; sequential performance improvement; shared-memory contention; software-cache-contention; static function usage; static task partitioning strategy; workload imbalance; Bayesian methods; Bioinformatics; Computer science; Multiprocessing systems; Partitioning algorithms; Scalability; Speech processing; Stochastic processes; Text mining; Yarn;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium, 2006. IPDPS 2006. 20th International
Print_ISBN :
1-4244-0054-6
Type :
conf
DOI :
10.1109/IPDPS.2006.1639610
Filename :
1639610
Link To Document :
بازگشت