• DocumentCode
    3433497
  • Title

    New and improved word-based unified and scalable architecture for radix 2 Montgomery modular multiplication algorithm

  • Author

    Ibrahim, Amin ; Gebali, Fayez ; Elsimary, Hamed

  • Author_Institution
    Salman Bin AbdulAziz Univ., Alkharj, Saudi Arabia
  • fYear
    2013
  • fDate
    27-29 Aug. 2013
  • Firstpage
    153
  • Lastpage
    158
  • Abstract
    This paper presents a new and improved word-based processor array architecture for unified and scalable radix2 Montgomery modular multiplication algorithm. In this architecture, the multiplicand and the modulus words are allocated to each processing element rather than pipelined between the processing elements as in the previous architecture extracted by Ç. Koç̧, and also the multiplier bits are fed serially to the first processing element of the processor array every odd clock cycle. Moreover, this architecture was modified to reduce the critical path delay and area by replacing the two levels of carry save adder (CSA) logic by modified 4-to-2 CSA that use only one level of dual field adder logic (DFA) taking advantage of processing two operand words by the same processing element (PE) of the processor array. An ASIC Implementation of the proposed architecture shows that it can perform 1024-bit modular multiplication (for word size w = 32) in about 17.07 μs. Also, the results show that it has smaller Area ×Time values compared to all existing designs by ratios ranging from 11.6 % to 47.8 % which makes it suitable for implementations where both area and performance are of concern. Moreover, it has higher throughput (1.8-39.5 %) than most of the published unified and scalable architectures except the architecture extracted by Harris. It has slightly higher throughput (4.5 %) than the proposed one.
  • Keywords
    computer architecture; microprocessor chips; pipeline processing; CSA logic; DFA; Radix 2 montgomery modular multiplication algorithm; carry save adder; critical path delay; dual field adder logic; odd clock cycle; processing element; processing elements; processor array; scalable radix2 Montgomery modular multiplication algorithm; unified radix2 Montgomery modular multiplication algorithm; word based processor; word based unified and scalable architecture; Adders; Arrays; Clocks; Doped fiber amplifiers; Hardware; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications, Computers and Signal Processing (PACRIM), 2013 IEEE Pacific Rim Conference on
  • Conference_Location
    Victoria, BC
  • ISSN
    1555-5798
  • Type

    conf

  • DOI
    10.1109/PACRIM.2013.6625466
  • Filename
    6625466