Title of article :
Optimizing a Parallel Video Encoder with Message Passing and a Shared Memory Architecture
Author/Authors :
GU, Junli Tsinghua University - Institute of Microelectronics, China , SUN, Yihe Tsinghua University - Institute of Microelectronics, China
From page :
393
To page :
398
Abstract :
Implementing video applications on emerging multi-core processors is a promising technique for personal, real-time multi-media applications. However, when porting the legacy parallel video encoders developed for clusters to shared-memory multi-cores, the existing parallel algorithms result in workload imbalances on different cores and communication inefficiencies. This paper describes a strip-wise parallel scheme to balance workloads and a hybrid communication mechanism to reduce communication overhead. The implementation of the H.264 parallel encoder on an eight CPU Intel Xeon system achieves 5x to 6x speed-up over a single thread encoder and achieves a 29% performance improvement over the commonly used master-slave schemes on clusters. The paper also gives further analysis on scalability, parallel efficiency, workload balance, and communication overhead as the number of cores varies.
Keywords :
parallel video encoder , speed improvement , message passing , shared memory
Journal title :
Tsinghua Science and Technology
Journal title :
Tsinghua Science and Technology
Record number :
2535396
Link To Document :
بازگشت