Title :
Accelerating and Adapting Precomputation Threads for Effcient Prefetching
Author :
Zhang, Weifeng ; Tullsen, Dean M. ; Calder, Brad
Author_Institution :
Dept. of Comput. Sci. & Eng., California Univ., San Diego, CA
Abstract :
Speculative precomputation enables effective cache prefetching for even irregular memory access behavior, by using an alternate thread on a multithreaded or multi-core architecture. This paper describes a system that constructs and runs precomputation based prefetching threads via event-driven dynamic optimization. Precomputation threads are dynamically constructed by a runtime compiler from the program´s frequently executed hot traces, and are adapted to the memory behavior automatically. Both construction and execution of the prefetching threads happen in another thread, imposing little overhead on the main thread. This paper also presents several techniques to accelerate the precomputation threads, including colocation of p-threads with hot traces, dynamic stride prediction, and automatic adaptation of runahead and jumpstart distance. The adaptive prefetching achieves 42% speedup, a 17% improvement over existing p-thread prefetching schemes
Keywords :
multi-threading; program compilers; storage management; adaptive prefetching; dynamic stride prediction; event-driven dynamic optimization; memory behavior; p-thread prefetching; precomputation threads; prefetching threads; runtime compiler; Acceleration; Aerodynamics; Computer architecture; Computer science; Delay; Hardware; Monitoring; Prefetching; Runtime; Yarn;
Conference_Titel :
High Performance Computer Architecture, 2007. HPCA 2007. IEEE 13th International Symposium on
Conference_Location :
Scottsdale, AZ
Print_ISBN :
1-4244-0805-9
Electronic_ISBN :
1-4244-0805-9
DOI :
10.1109/HPCA.2007.346187