Title :
Profile-guided application partitioning for heterogeneous reconfigurable platforms
Author :
Ostadzadeh, S. Arash ; Meeuws, Roel ; Ashraf, Imran ; Galuzzi, Carlo ; Bertels, Koen
Author_Institution :
Comput. Eng. Group, Delft Univ. of Technol., Delft, Netherlands
Abstract :
The increased complexity of programming heterogeneous reconfigurable platforms requires a thorough understanding of application behavior, for which developers need sophisticated analysis tools. One particular problem, which severely limits the performance gain of running applications on these platforms, is the inappropriateness of the kernels mapped onto the reconfigurable fabrics. Efficient porting of legacy applications to these emerging heterogeneous platforms demands code tuning considering several critical points, such as, proper kernel size and small memory communication overhead. Detailed profiling information is thus vital for an efficient HW/SW co-design. To facilitate addressing these issues, we developed the Q2 profiling framework. It consists of two parts: an advanced memory access profiling toolset that provides detailed information on the run-time memory access patterns of an application and a statistical modeling framework that makes predictions for resources, early in the design phase, based on software metrics. The code optimizations triggered by careful analysis of the profiling information is used to tailor existing applications for heterogeneous reconfigurable platforms. In this paper, we examine a real application in detail to show the potential of the proposed profiling framework. Experimental results show that a speedup of 1.3× is achieved by accelerating a merged kernel of four critical functions in the application.
Keywords :
hardware-software codesign; optimising compilers; program diagnostics; reconfigurable architectures; software maintenance; software metrics; software portability; statistical analysis; HW/SW codesign; Q2 profiling framework; application performance gain; code optimizations; code tuning; critical functions; heterogeneous reconfigurable platform programming complexity; kernel mapping; kernel size; legacy application porting; memory access profiling toolset; memory communication overhead; profile-guided application partitioning; reconfigurable fabrics; resource predictions; run-time memory access patterns; software metrics; statistical modeling framework; Computer architecture; Field programmable gate arrays; Hardware; Kernel; Merging; Speech;
Conference_Titel :
Computer Architecture and Digital Systems (CADS), 2012 16th CSI International Symposium on
Conference_Location :
Shiraz, Fars
Print_ISBN :
978-1-4673-1481-7
DOI :
10.1109/CADS.2012.6316416