DocumentCode
511767
Title
An instruction redundancy removal method on a transport triggered architecture processor
Author
Wang, Su ; Yao, Suying ; Guo, Wei ; Wei, Jizeng
Author_Institution
Sch. of Electron. & Inf. Eng., Tianjin Univ., Tianjin, China
fYear
2009
fDate
14-16 Dec. 2009
Firstpage
602
Lastpage
604
Abstract
As a class of configurable instruction-level parallelism architectures, transport triggered architecture (TTA) processors, with simple but high cost-effective architecture, have more advantage in the field of embedded digital signal processing applications comparing with the general purpose processors. In this paper, an instruction redundancy removal TTA processor with base plus offset addressing load/store function unit (LSFU) and no-operation (NOP) instruction slots reuse is introduced. With the new LSFU, a TTA processor´s addressing ability and flexibility were greatly improved. At the same time, program memory size was saved, because the number of long immediate data in the program was dramatically decreased. Furthermore, the novel NOP instruction slots reuse also shrunk the program memory size by utilizing the waste NOP instruction slots. To evaluate the instruction redundancy removal method, a 32-point DCT benchmark was used. Comparing with the results from a general purpose processor, ARM9E-S, and a traditional TTA processor published before, it shows significant efficiency improvement in our design.
Keywords
instruction sets; microprocessor chips; parallel architectures; DCT benchmark; base plus offset addressing load-store function unit; configurable instruction-level parallelism architectures; embedded digital signal processing; general purpose processors; instruction redundancy removal TTA processor; instruction redundancy removal method; no-operation instruction slots reuse; program memory size; transport triggered architecture processor; Application software; Computer architecture; Computer science; Decoding; Digital signal processing; Discrete cosine transforms; Encoding; Parallel processing; Registers; VLIW; 32-point DCT; base plus offset addressing; instruction compression; transport triggered architecture;
fLanguage
English
Publisher
ieee
Conference_Titel
Integrated Circuits, ISIC '09. Proceedings of the 2009 12th International Symposium on
Conference_Location
Singapore
Print_ISBN
978-9-8108-2468-6
Type
conf
Filename
5403730
Link To Document