Title :
On the Efficiency of Register File versus Broadcast Interconnect for Collective Communications in Data-Parallel Hardware Accelerators
Author :
Pedram, Ardavan ; Gerstlauer, Andreas ; Van de Geijn, Robert A.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Texas at Austin, Austin, TX, USA
Abstract :
Reducing power consumption and increasing efficiency is a key concern for many applications. How to design highly efficient computing elements while maintaining enough flexibility within a domain of applications is a fundamental question. In this paper, we present how broadcast buses can eliminate the use of power hungry multi-ported register files in the context of data-parallel hardware accelerators for linear algebra operations. We demonstrate an algorithm/architecture co-design for the mapping of different collective communication operations, which are crucial for achieving performance and efficiency in most linear algebra routines, such as GEMM, SYRK and matrix transposition. We compare a broadcast bus based architecture with conventional SIMD, 2D-SIMD and flat register file for these operations in terms of area and energy efficiency. Results show that fast broadcast data movement abilities in a prototypical linear algebra core can achieve up to 75× better power and up to 10× better area efficiency compared to traditional SIMD architectures.
Keywords :
data handling; file organisation; parallel processing; power aware computing; SIMD; broadcast buses; broadcast interconnect; collective communications; data parallel hardware accelerators; energy efficiency; linear algebra; matrix transposition; power consumption; register file; Arrays; Hardware; Registers; Symmetric matrices; Vectors; Broadcast bus; High performance computing; Matrix Multiply; Power efficiency; Register-file;
Conference_Titel :
Computer Architecture and High Performance Computing (SBAC-PAD), 2012 IEEE 24th International Symposium on
Conference_Location :
New York, NY
Print_ISBN :
978-1-4673-4790-7
DOI :
10.1109/SBAC-PAD.2012.35