• DocumentCode
    3208774
  • Title

    An integrated memory array processor architecture for embedded image recognition systems

  • Author

    Kyo, Shorin ; Okazaki, Shin´ichiro ; Arai, Tamio

  • Author_Institution
    Media & Inf. Res. Labs., NEC Corp., Japan
  • fYear
    2005
  • fDate
    4-8 June 2005
  • Firstpage
    134
  • Lastpage
    145
  • Abstract
    Embedded processors for video image recognition require to address both the cost (die size and power) versus real-time performance issue, and also to achieve high flexibility due to the immense diversity of recognition targets, situations, and applications. This paper describes IMAP, a highly parallel SIMD linear processor and memory array architecture that addresses these trading-off requirements. By using parallel and systolic algorithmic techniques, despite of its simple architecture IMAP achieves to exploit not only the straightforward per image row data level parallelism (DLP), but also the inherent DLP of other memory access patterns frequently found in various image recognition tasks, under the use of an explicit parallel C language (IDC). We describe and evaluate IMAP-CE, a latest IMAP processor, which integrates 128 of 100MHz 8 bit 4-way VLIW PEs, 128 of 2KByte RAMs, and one 16 bit RISC control processor, into a single chip. The PE instruction set is enhanced for supporting IDC codes. IMAP-CE is evaluated mainly by comparing its performance running IDC codes with that of a 2.4GHz Intel P4 running optimized C codes. Based on the use of parallelizing techniques, benchmark results show a speedup of up to 20 for image filter kernels, and of 4 for a full image recognition application.
  • Keywords
    C language; image recognition; memory architecture; microprocessor chips; parallel architectures; parallelising compilers; program processors; random-access storage; reduced instruction set computing; IDC code; IMAP processor; IMAP-CE; PE instruction set; RAM; RISC control processor; SIMD linear processor; VLIW PE; data level parallelism; embedded image recognition system; image filter kernel; memory array processor architecture; parallel C language; parallel algorithmic technique; systolic algorithmic technique; video image recognition; Costs; Filters; Image recognition; Kernel; Memory architecture; Process control; Random access memory; Reduced instruction set computing; Target recognition; VLIW;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Architecture, 2005. ISCA '05. Proceedings. 32nd International Symposium on
  • ISSN
    1063-6897
  • Print_ISBN
    0-7695-2270-X
  • Type

    conf

  • DOI
    10.1109/ISCA.2005.11
  • Filename
    1431552