مرکز منطقه ای اطلاع رساني علوم و فناوري - Memory-Aware Loop Paralleling for Coarse-Grained Reconfigurable Architectures

DocumentCode :

2725227

Title :

Memory-Aware Loop Paralleling for Coarse-Grained Reconfigurable Architectures

Author :

Yang, Ziyu ; Zhao, Peng ; Wang, Dawei ; Li, Sikun

Author_Institution :

Sch. of Comput., Nat. Univ. of Defense Technol., Changsha, China

fYear :

2012

fDate :

11-13 Aug. 2012

Firstpage :

2223

Lastpage :

2226

Abstract :

The parallelization of sequential programs and the optimization of critical loops are challenging issues in the time of multi-core architectures. Coarse-Grained Reconfigurable Architecture (CGRA) is introduced to accelerate these data-intensive applications, while the access delay introduced by the massive memory accesses contained in those loops has become the bottleneck of CGRA´s performance. In this paper we focus on the parallel optimization of critical loops under the CGRA´s hardware constraints. At first we propose a novel approach to parallelize loops by multi-level tilling. Then a genetic algorithm is introduced to schedule tiled loops with memory-aware object functions. Data locality and communication cost are optimized during the parallel processing as well. Experimental results show that our approach can generate more effective parallel tasks to improve the data locality and load-balanced execution, while obtains better speedup compared with the memory-unaware parallel processing.

Keywords :

genetic algorithms; memory architecture; parallel architectures; parallel programming; performance evaluation; processor scheduling; reconfigurable architectures; resource allocation; CGRA hardware constraints; CGRA performance; access delay; coarse-grained reconfigurable architectures; communication cost optimization; critical loop optimization; critical loop parallel optimization; data locality improvement; data locality optimization; data-intensive applications; genetic algorithm; load-balanced execution improvement; memory-aware loop parallelization; memory-aware object functions; multicore architectures; multilevel tiled loop scheduling; parallel processing; sequential program parallelization; Arrays; Computational modeling; Memory management; Parallel processing; Reconfigurable architectures; Tiles; Vectors; coarse-grained reconfigurable architecture; loop parallelization; multi-level tilling; polyhedral model;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Science & Service System (CSSS), 2012 International Conference on

Conference_Location :

Nanjing

Print_ISBN :

978-1-4673-0721-5

Type :

conf

DOI :

10.1109/CSSS.2012.552

Filename :

6394870

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2725227