Title :
Performance optimization of vision apps on mobile application processor
Author :
Kwang-Ting Cheng ; Xin Yang ; Yi-Chu Wang
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of California, Santa Barbara, Santa Barbara, CA, USA
Abstract :
Optimizing performance of compute-intensive vision apps running on mobile application processor (AP) is critical to satisfactory experience for smartphone and tablet users. Most existing vision algorithms were primarily designed and implemented for desktop and server platforms. Porting them to a mobile platform without adapting the algorithms to account for the platform´s limitations would cause serious algorithmhardware mismatches, yielding unnecessary runtime degradation. Modern mobile AP, which integrates multicore CPUs, GPUs and other special-purpose accelerators, offers multiple options of porting vision apps to various computing cores. To develop an optimized implementation for a vision app, it is necessary to understand the potential mismatches for better algorithm adaptation and optimized mapping of the algorithm to a handheld platform. In this paper, we identify mismatches and propose adaptation guidelines for three different porting strategies: porting an algorithm to 1) a mobile CPU, 2) a mobile GPU, and 3) mobile heterogeneous multicores (mobile CPUs+GPUs). For each strategy, we illustrate the adaptation/porting guidelines using an exemplar vision task. Experimental results demonstrate that with proper adaptation following the proposed guidelines, we could achieve a significant speedup with little accuracy drop.
Keywords :
computer vision; graphics processing units; mobile computing; multiprocessing systems; performance evaluation; program processors; smart phones; algorithm-hardware mismatches; compute-intensive vision app performance optimization; computing cores; desktop platforms; handheld platform; mobile AP; mobile CPU; mobile GPU; mobile application processor; mobile heterogeneous multicores; multicore CPU; runtime degradation; server platforms; smartphone users; special-purpose accelerators; tablet users; vision algorithms; Acceleration; Face; Face recognition; Feature extraction; Graphics processing units; Mobile communication; Runtime; GPU acceleration; algorithm adaption; heterogeneous parallel computing; mobile application processor;
Conference_Titel :
Systems, Signals and Image Processing (IWSSIP), 2013 20th International Conference on
Conference_Location :
Bucharest
Print_ISBN :
978-1-4799-0941-4
DOI :
10.1109/IWSSIP.2013.6623485