DocumentCode :
3043629
Title :
Supercompilers, the AMD Opteron, and your cell phone
Author :
Wolfe, Michael
fYear :
2004
fDate :
26-30 April 2004
Firstpage :
98
Abstract :
Summary form only given. As The Portland Group Compiler Technology and the Advanced Compilers Tools group within STMicroelectronics, we deliver compilers for both the high performance technical computing market, such as Summary form only given. AMD Opteron servers, and embedded DSP microcores, in particular, the ST100 from STMicroelectronics. This article explores the similarities and differences between compiler optimization strategies for these two markets. We demonstrate that many of the optimizations developed for high performance systems apply in the embedded market. For instance, compilers for both markets use vectorization for multimedia-type operations, loop unrolling, and interprocedural analysis. Software pipelining, first developed for the technical computing market, is now used for the ST100. There are also important differences; multiprocessing is quite mature in the technical computing industry, but is only beginning to be explored in the embedded market. On the other hand, optimizing for performance is not the only goal in the embedded industry; the total memory usage is also quite important. This article shows case-by-case performance improvements on the AMD Opteron and ST122 DSP for a selected set of common optimizations.
Keywords :
digital signal processing chips; embedded systems; optimising compilers; AMD Opteron server; Advanced Compilers Tools group; ST100; ST122 DSP; STMicroelectronics; The Portland Group Compiler Technology; cell phone; compiler optimization; embedded DSP microcores; embedded market; interprocedural analysis; loop unrolling; multimedia-type operation; multiprocessing; software pipelining; supercompiler; technical computing market; vectorization; Application software; Cellular phones; Digital signal processing; Embedded computing; High performance computing; Optimizing compilers; Out of order; Supercomputers; Trademarks; VLIW;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium, 2004. Proceedings. 18th International
Print_ISBN :
0-7695-2132-0
Type :
conf
DOI :
10.1109/IPDPS.2004.1303044
Filename :
1303044
Link To Document :
بازگشت