• DocumentCode
    85376
  • Title

    A 2.05 GVertices/s 151 mW Lighting Accelerator for 3D Graphics Vertex and Pixel Shading in 32 nm CMOS

  • Author

    Sheikh, Farhana ; Mathew, Sanu K. ; Anders, Mark A. ; Kaul, Himanshu ; Hsu, S.K. ; Agarwal, Abhishek ; Krishnamurthy, Ram K. ; Borkar, Shekhar

  • Author_Institution
    Circuit Res. Lab., Intel Corp., Hillsboro, OR, USA
  • Volume
    48
  • Issue
    1
  • fYear
    2013
  • fDate
    Jan. 2013
  • Firstpage
    128
  • Lastpage
    139
  • Abstract
    This paper describes a single-cycle throughput lighting accelerator fabricated in 1.05 V, 32 nm CMOS for on-die acceleration of 3D graphics vertex and pixel shading in high-performance processors and mobile systems-on-chip. Log-domain parallel computation of ambient, diffuse, and specular lighting using high-accuracy 32b log and anti-log units that convert computation from floating-point (FP) to fixed-point domain, 32b sparse-tree fixed-point adders and a 32 × 32b signed fixed-point multiplier with truncated partial product reduction tree enable 2.05 GVertices/s throughput measured at 1.05 V, 25°C in an area of 0.064 mm2 while achieving: (i) 47% reduction in critical path logic stages compared to previously published work, (ii) 0.56% mean vertex lighting error compared to single-precision FP computation, (iii) 354 μW active leakage power measured at 1.05 V, 25 °C, (iv) scalable performance up to 2.22 GHz, 232 mW measured at 1.2 V, (v) peak energy-efficiency of 56 GVertices/s/W, measured at 560 mV, 25 °C, and (vi) 119.6 dB PSNR for a 2 M pixel high-resolution 3D image.
  • Keywords
    CMOS integrated circuits; adders; image resolution; parallel processing; system-on-chip; trees (mathematics); 3D graphics vertex; CMOS; PSNR; active leakage power; antilog unit; critical path logic; fixed-point domain; floating-point; frequency 2.22 GHz; high-accuracy log unit; high-performance processor; log-domain parallel computation; mean vertex lighting error; mobile systems-on-chip; noise figure 119.6 dB; on-die acceleration; peak energy-efficiency; pixel high-resolution 3D image; pixel shading; power 151 mW; power 232 mW; power 354 muW; signed fixed-point multiplier; single-cycle throughput lighting accelerator; single-precision FP computation; size 32 nm; sparse-tree fixed-point adder; specular lighting; temperature 25 C; truncated partial product reduction tree; voltage 1.05 V; voltage 1.2 V; voltage 560 V; Acceleration; Accuracy; Approximation methods; Equations; Graphics; Lighting; Mathematical model; 3D graphics; Phong illumination; fixed-point; lighting acceleration; log arithmetic; vertex and pixel shading;
  • fLanguage
    English
  • Journal_Title
    Solid-State Circuits, IEEE Journal of
  • Publisher
    ieee
  • ISSN
    0018-9200
  • Type

    jour

  • DOI
    10.1109/JSSC.2012.2222813
  • Filename
    6374707