DocumentCode :
2680966
Title :
Empirical Analysis of Multicore CPU and GPU-Based Parallel Solutions to Sustain Throughput Needed by Scalable Proxy Servers for Protected Videos
Author :
Gomes, Leandro A S ; Neves, Bruno S. ; Pinho, Leonardo B.
Author_Institution :
Eng. de Comput., Univ. Fed. do Pampa (UNIPAMPA), Bage, Brazil
fYear :
2012
fDate :
17-19 Oct. 2012
Firstpage :
49
Lastpage :
56
Abstract :
Proxy servers of scalable video distribution systems must perform not only efficient memory management but also adopt video protection mechanisms. This work proposes an adaptive mix-grained parallelization of AES ciphering algorithm to provide customized video segments to concurrent clients implemented using CUDA, Pthreads, and OpenMP in order to exploit multicore CPU or GPU. An evaluation is conducted using a server with Hyper-Threading (HT) capable multicore CPU and state-of-the-art 448-core GPU in contrast to a desktop using multicore CPU without HT and low-cost 128-core GPU. Overall, PCI-Express impacts CUDA´s achievable throughput and HT affects the amount of cores and cooperative threads needed by application for Pthreads and even more for OpenMP. As expected, CUDA reaches higher throughput but a comparison between GPUs regarding thread occupancy demonstrates that greater core availability doesn´t guarantee highest throughput, which will be demanded as network capacity of proxy servers is migrating from 1 to 10 Gbps.
Keywords :
client-server systems; cryptography; multi-threading; multiprocessing systems; parallel architectures; video coding; 448-core GPU; AES ciphering algorithm; CUDA; GPU-based parallel solution; OpenMP; PCI-Express; Pthreads; adaptive mix-grained parallelization; concurrent client; cooperative threads; core availability; customized video segments; empirical analysis; hyper-threading capable multicore CPU; memory management; network capacity; protected video; scalable proxy server; scalable video distribution system; thread occupancy; video protection mechanism; Graphics processing units; Instruction sets; Kernel; Multicore processing; Servers; Throughput; Videos; GPU application; parallel programming; proxy reencryption; video distribution;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Systems (WSCAD-SSC), 2012 13th Symposium on
Conference_Location :
Petropolis
Print_ISBN :
978-1-4673-4468-5
Type :
conf
DOI :
10.1109/WSCAD-SSC.2012.37
Filename :
6391763
Link To Document :
بازگشت