Title :
Aggressive dynamic execution of multimedia kernel traces
Author :
Bishop, Benjamin ; Owens, Robert ; Irwin, Mary Jane
Author_Institution :
Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
fDate :
30 Mar-3 Apr 1998
Abstract :
There has been relatively little analytical work on processor optimizations for multimedia applications. With the introduction of MMX by Intel, it is clear that this is an area of increasing importance. Building on previous work, the authors propose optimizations for multimedia architectures that support independent parallel execution of instructions within dynamically assembled traces, resulting in dramatic performance improvements. Specifically they propose simplified instruction scheduling and register renaming algorithms due to constraints on trace formation. In addition, they suggest specific instruction pool and trace cache parameters. They constructed a simulator in order to measure the benefits of these processor optimizations for multimedia applications. The simulated machine, which could fetch/decode 2 instructions per cycle, performed better than a superscalar machine that could fetch/decode 8 instructions per cycle. Execution rates as high as 7.3 instructions per cycle were achieved for the benchmarks simulated, assuming 16 instructions per trace
Keywords :
cache storage; multimedia computing; optimisation; parallel architectures; processor scheduling; virtual machines; Intel MMX; aggressive dynamic execution; dynamically assembled traces; execution rates; independent parallel instruction execution; instruction decoding; instruction fetching; instruction pool parameters; instruction scheduling; multimedia applications; multimedia architectures; multimedia kernel traces; performance improvements; processor optimization; register renaming algorithm; simulator; trace cache parameters; trace formation constraints; Application software; Assembly; Buildings; Computer science; Decoding; Dynamic scheduling; Kernel; Parallel processing; Processor scheduling; Scheduling algorithm;
Conference_Titel :
Parallel Processing Symposium, 1998. IPPS/SPDP 1998. Proceedings of the First Merged International ... and Symposium on Parallel and Distributed Processing 1998
Conference_Location :
Orlando, FL
Print_ISBN :
0-8186-8404-6
DOI :
10.1109/IPPS.1998.669994