Abstract :
The parallelization of the H.261 video coding algorithm on the IBM SP2(R) multiprocessor system is described. The effect of parallelizing computations and communications in the spatial, temporal, and both spatial-temporal domains are considered through the study of frame rate, speedup, and implementation efficiency, which are modeled and measured with respect to the number of nodes (n) and parallel methods used. Four parallel algorithms were developed, of which the first two exploited the spatial parallelism in each frame, and the last two exploited both the temporal and spatial parallelism over a sequence of frames. The two spatial algorithms differ in that one utilizes a single communication master, while the other attempts to distribute communications across three masters. On the other hand, the spatial-temporal algorithms use a pipeline structure for exploiting the temporal parallelism together with either a single master or multiple masters. The best median speedup (frame rate) achieved was close to 15 [15 frames per second (fps)] for 352×240 video on 24 nodes, and 13 (37 fps) for QCIF video, by the spatial algorithm with distributed communications. For n<10, the single-master spatial algorithm performs better with efficiency up to 90%, while the multiple-master spatial algorithm is superior for n>10, with efficiency up to 70%. The spatial-temporal algorithms achieved average speedup performance, but are most scalable for large n
Keywords :
code standards; data compression; multiprocessing systems; parallel algorithms; telecommunication standards; video coding; H.261 video coding algorithm; IBM SP2 multiprocessor system; QCIF video; average speedup performance; distributed communications; efficiency; frame rate; frames sequence; implementation efficiency; median speedup; multiple-master spatial algorithm; parallel algorithms; pipeline structure; single-master spatial algorithm; spatial data parallelization; spatial-temporal algorithms; temporal data parallelization; temporal parallelism; Concurrent computing; Image storage; Motion estimation; Multiprocessing systems; Parallel algorithms; Parallel processing; Pipelines; Velocity measurement; Video coding; Video sequences;