• DocumentCode
    511767
  • Title

    An instruction redundancy removal method on a transport triggered architecture processor

  • Author

    Wang, Su ; Yao, Suying ; Guo, Wei ; Wei, Jizeng

  • Author_Institution
    Sch. of Electron. & Inf. Eng., Tianjin Univ., Tianjin, China
  • fYear
    2009
  • fDate
    14-16 Dec. 2009
  • Firstpage
    602
  • Lastpage
    604
  • Abstract
    As a class of configurable instruction-level parallelism architectures, transport triggered architecture (TTA) processors, with simple but high cost-effective architecture, have more advantage in the field of embedded digital signal processing applications comparing with the general purpose processors. In this paper, an instruction redundancy removal TTA processor with base plus offset addressing load/store function unit (LSFU) and no-operation (NOP) instruction slots reuse is introduced. With the new LSFU, a TTA processor´s addressing ability and flexibility were greatly improved. At the same time, program memory size was saved, because the number of long immediate data in the program was dramatically decreased. Furthermore, the novel NOP instruction slots reuse also shrunk the program memory size by utilizing the waste NOP instruction slots. To evaluate the instruction redundancy removal method, a 32-point DCT benchmark was used. Comparing with the results from a general purpose processor, ARM9E-S, and a traditional TTA processor published before, it shows significant efficiency improvement in our design.
  • Keywords
    instruction sets; microprocessor chips; parallel architectures; DCT benchmark; base plus offset addressing load-store function unit; configurable instruction-level parallelism architectures; embedded digital signal processing; general purpose processors; instruction redundancy removal TTA processor; instruction redundancy removal method; no-operation instruction slots reuse; program memory size; transport triggered architecture processor; Application software; Computer architecture; Computer science; Decoding; Digital signal processing; Discrete cosine transforms; Encoding; Parallel processing; Registers; VLIW; 32-point DCT; base plus offset addressing; instruction compression; transport triggered architecture;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Integrated Circuits, ISIC '09. Proceedings of the 2009 12th International Symposium on
  • Conference_Location
    Singapore
  • Print_ISBN
    978-9-8108-2468-6
  • Type

    conf

  • Filename
    5403730