Title :
Challenges in exploitation of loop parallelism in embedded applications
Author :
Kejariwal, A. ; Veidenbaum, A.V. ; Nicolau, A. ; Girkar, M. ; Xinmin Tian
Author_Institution :
Univ. of California, Irvine
Abstract :
Embedded processors have been increasingly exploiting hardware parallelism. Vector units, multiple processors or cores, hyper-threading, special-purpose accelerators such as DSPs or cryptographic engines, or a combination of the above have appeared in a number of processors. They serve to address the increasing performance requirements of modern embedded applications. How this hardware parallelism can be exploited by applications is directly related to the amount of parallelism inherent in a target application. In this paper we evaluate the performance potential of different types of parallelism, viz., true thread-level parallelism, speculative thread- level parallelism and vector parallelism, when executing loops. Applications from the industry-standard EEMBC 1.1, EEMBC 2.0 and the MiBench embedded benchmark suites are analyzed using the Intel C compiler. The results show what can be achieved today, provide upper bounds on the performance potential of different types of thread parallelism, and point out a number of issues that need to be addressed to improve performance. The latter include parallelization of libraries such as libc and design of parallel algorithms to allow maximal exploitation of parallelism. The results also point to the need for developing new benchmark suites more suitable to parallel compilation and execution.
Keywords :
embedded systems; multi-threading; multiprocessing systems; parallel processing; parallelising compilers; EEMBC 1.1industry-standard; EEMBC 2.0 industry-standard; Intel C compiler; MiBench embedded benchmark suite; embedded processor; hardware loop parallelism; multiple processor; thread-level parallelism; Cryptography; Engines; Hardware; Multithreading; Parallel processing; Parallel programming; Permission; Software engineering; Software libraries; Software measurement; libraries; multi-cores; multithreading; parallel loops; programming models; thread-level speculation; vectorization;
Conference_Titel :
Hardware/Software Codesign and System Synthesis, 2006. CODES+ISSS '06. Proceedings of the 4th International Conference
Conference_Location :
Seoul
Print_ISBN :
1-59593-370-0
Electronic_ISBN :
1-59593-370-0
DOI :
10.1145/1176254.1176298