DocumentCode :
1220040
Title :
Continual flow pipelines: achieving resource-efficient latency tolerance
Author :
Srinivasan, Srikanth T. ; Rajwar, Ravi ; Akkary, Haitham ; Gandhi, Amit ; Upton, Michael
Volume :
24
Issue :
6
fYear :
2004
Firstpage :
62
Lastpage :
73
Abstract :
With the natural trend toward integration, microprocessors are increasingly supporting multiple cores on a single chip. To keep design effort and costs down, designers of these multicore microprocessors frequently target an entire product range, from mobile laptops to high-end servers. This article discusses a continual flow pipeline (CFP) processor. Such processor architecture can sustain a large number of in-flight instructions (commonly referred to as the instruction window and comprising all instructions renamed but not retired) without requiring the cycle-critical structures to scale up. By keeping these structures small and making the processor core tolerant of memory latencies, a CFP mechanism enables the new core to achieve high single-thread performance, and many of these new cores can be placed on a chip for high throughput. The resulting large instruction window reveals substantial instruction-level parallelism and achieves memory latency tolerance, while the small size of cycle-critical resources permits a high clock frequency
Keywords :
cache storage; checkpointing; instruction sets; parallel architectures; pipeline processing; cache storage; checkpointing; continual flow pipeline processor; in-flight instructions; instruction-level parallelism; memory latency tolerance; microprocessors; resource-efficient latency tolerance; Costs; Delay; Microprocessors; Multicore processing; Pipelines; Portable computers; Process design; Processor scheduling; Registers; Throughput;
fLanguage :
English
Journal_Title :
Micro, IEEE
Publisher :
ieee
ISSN :
0272-1732
Type :
jour
DOI :
10.1109/MM.2004.71
Filename :
1388159
Link To Document :
بازگشت