DocumentCode
1628730
Title
A dedicated adaptive loop pre-fetch mechanism for stream-like application
Author
Huang, Xiao-Ping ; Fan, Xiao-Ya ; Chen, Yu-Hui ; He, Xiang-Dong
Author_Institution
Comput. Sch., Northwestern Polytech. Univ., Xian, China
fYear
2010
Firstpage
575
Lastpage
577
Abstract
For the stream-like applications with high-bandwidth and low latency, optimizing the memory latency can effectively improve the QoS. In this paper, we propose a dedicated adaptive loop pre-fetch mechanism to reduce the memory latency and also improve the pre-fetch accuracy. In the mechanism, when a loop sequences is detected, the stream pre-fetch engine can adaptively initiate the pre-fetch operation and store the return data into the on-chip stream buffers. The pre-fetch engine consists of loop sequences recognition, stream buffer FIFOs, address calculation ALU. A hardware engine is implemented and integrated into a processor to verify the mechanism. When the processor with the pre-fetch engine is running a regular loop sequences, it can save 2/3 to 1/2 of the time spent on memory latency. Also the mechanism can alleviate the cache pollution and the cache thrash.
Keywords
microprocessor chips; quality of service; semiconductor storage; video streaming; QoS; adaptive loop prefetch mechanism; address calculation ALU; cache thrash; loop sequences recognition; memory latency; prefetch engine; processor; stream buffer FIFO; stream like application; Buffer storage; Data mining; Engines; Hardware; Pipelines; Pollution; Radiation detectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Solid-State and Integrated Circuit Technology (ICSICT), 2010 10th IEEE International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-4244-5797-7
Type
conf
DOI
10.1109/ICSICT.2010.5667325
Filename
5667325
Link To Document