Title :
Optimizing overall loop schedules using prefetching and partitioning
Author :
Chen, Fei ; O´Neil, Timothy W. ; Sha, Edwin H M
Author_Institution :
Dept. of Comput. Sci., Notre Dame Univ., IN, USA
fDate :
6/1/2000 12:00:00 AM
Abstract :
In this paper, a method combining the loop pipelining technique with data prefetching, called Partition Scheduling with Prefetching (PSP), is proposed. In PSP, the iteration space is first divided into regular partitions. Then a two-part schedule, consisting of the ALU and memory parts, is produced and balanced to produce high throughput. These two parts are executed simultaneously, and hence, the remote memory latencies are overlapped. We study the optimal partition shape and size so that a well-balanced overall schedule can be obtained. Experiments on DSP benchmarks show that the proposed methodology consistently produces optimal or near optimal solutions
Keywords :
storage management; data prefetching; loop pipelining; optimal partition shape; overall loop schedules; partitioning; prefetching; remote memory latencies; Delay; Digital signal processing; Hardware; Partitioning algorithms; Pipeline processing; Prefetching; Processor scheduling; Scheduling algorithm; Shape; Throughput;
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on