DocumentCode :
511767
Title :
An instruction redundancy removal method on a transport triggered architecture processor
Author :
Wang, Su ; Yao, Suying ; Guo, Wei ; Wei, Jizeng
Author_Institution :
Sch. of Electron. & Inf. Eng., Tianjin Univ., Tianjin, China
fYear :
2009
fDate :
14-16 Dec. 2009
Firstpage :
602
Lastpage :
604
Abstract :
As a class of configurable instruction-level parallelism architectures, transport triggered architecture (TTA) processors, with simple but high cost-effective architecture, have more advantage in the field of embedded digital signal processing applications comparing with the general purpose processors. In this paper, an instruction redundancy removal TTA processor with base plus offset addressing load/store function unit (LSFU) and no-operation (NOP) instruction slots reuse is introduced. With the new LSFU, a TTA processor´s addressing ability and flexibility were greatly improved. At the same time, program memory size was saved, because the number of long immediate data in the program was dramatically decreased. Furthermore, the novel NOP instruction slots reuse also shrunk the program memory size by utilizing the waste NOP instruction slots. To evaluate the instruction redundancy removal method, a 32-point DCT benchmark was used. Comparing with the results from a general purpose processor, ARM9E-S, and a traditional TTA processor published before, it shows significant efficiency improvement in our design.
Keywords :
instruction sets; microprocessor chips; parallel architectures; DCT benchmark; base plus offset addressing load-store function unit; configurable instruction-level parallelism architectures; embedded digital signal processing; general purpose processors; instruction redundancy removal TTA processor; instruction redundancy removal method; no-operation instruction slots reuse; program memory size; transport triggered architecture processor; Application software; Computer architecture; Computer science; Decoding; Digital signal processing; Discrete cosine transforms; Encoding; Parallel processing; Registers; VLIW; 32-point DCT; base plus offset addressing; instruction compression; transport triggered architecture;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Integrated Circuits, ISIC '09. Proceedings of the 2009 12th International Symposium on
Conference_Location :
Singapore
Print_ISBN :
978-9-8108-2468-6
Type :
conf
Filename :
5403730
Link To Document :
بازگشت