DocumentCode
2954111
Title
Broadcast with mask on a massively parallel processing on a chip
Author
Krichene, Hana ; Baklouti, Mouna ; Abid, Mohamed ; Marquet, Philippe ; Dekeyser, Jean Luc
Author_Institution
LIFL, Univ. of Lille 1, Lille, France
fYear
2012
fDate
2-6 July 2012
Firstpage
275
Lastpage
280
Abstract
The delay of instructions broadcast has a significant impact on the performance of Single Instruction Multiple Data (SIMD) architecture. This is especially true for massively parallel processing Systems-on-Chip (mppSoC), where the processing stage and that of setting up the communication mechanism need several clock periods. Subnetting is the strategy used to partition a single physical network into more than one smaller logical sub-networks (subnets). This technique better controls the broadcast instructions domain and the data traffic between network nodes. Furthermore, it allows to separate synchronous communications from asynchronous processing which maintains reliable communications and rapid processing through parallel processors. This paper describes the design of a communication model called broadcast with mask. This model is dedicated to mppSoC architecture with a huge number of processor elements because it maintains performances even when the number of processors increases. Simulation results and an FPGA implementation validate our approach.
Keywords
broadcast communication; clocks; field programmable gate arrays; parallel architectures; system-on-chip; telecommunication network reliability; telecommunication traffic; FPGA; SIMD architecture; asynchronous processing; broadcast instruction; broadcast with mask; clock period; communication mechanism; communication model; data traffic; logical subnetwork; mppSoC architecture; network node; parallel processing systems-on-chip; parallel processor; physical network; processing stage; processor element; reliable communication; single instruction multiple data architecture; subnetting; synchronous communication; Clocks; Computer architecture; Field programmable gate arrays; Parallel processing; Process control; Program processors; Registers;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing and Simulation (HPCS), 2012 International Conference on
Conference_Location
Madrid
Print_ISBN
978-1-4673-2359-8
Type
conf
DOI
10.1109/HPCSim.2012.6266924
Filename
6266924
Link To Document