Title : 
Scatter-add in data parallel architectures
         
        
            Author : 
Ahn, Jung Ho ; Erez, Mattan ; Dally, William J.
         
        
            Author_Institution : 
Comput. Syst. Lab., Stanford Univ., CA, USA
         
        
        
        
        
        
            Abstract : 
Many important applications exhibit large amounts of data parallelism, and modern computer systems are designed to take advantage of it. While much of the computation in the multimedia and scientific application domains is data parallel, certain operations require costly serialization that increase the run time. Examples include superposition type updates in scientific computing and histogram computations in media processing. We introduce scatter-add, which is the data-parallel form of the well-known scalar fetch-and-op, specifically tuned for SIMD/vector/stream style memory systems. The scatter-add mechanism scatters a set of data values to a set of memory addresses and adds each data value to each referenced memory location instead of overwriting it. This novel architecture extension allows us to efficiently support data-parallel atomic update computations found in parallel programming languages such as HPF, and applies both to single-processor and multiprocessor SIMD data-parallel systems. We detail the microarchitecture of a scatter-add implementation on a stream architecture, which requires less than 2% increase in die area yet shows performance speedups ranging from 1.45 to over 11 on a set of applications that require a scatter-add computation.
         
        
            Keywords : 
parallel architectures; parallel programming; storage allocation; SIMD style memory system; computer systems; data parallel architectures; data-parallel atomic update computations; memory addresses; multimedia application; multiprocessor SIMD data-parallel system; parallel programming languages; referenced memory location; scatter-add microarchitecture; scientific application; single-processor SIMD data-parallel system; stream architecture; stream style memory system; vector style memory system; Application software; Computer architecture; Concurrent computing; Histograms; Parallel architectures; Parallel processing; Parallel programming; Quantum computing; Scattering; Scientific computing;
         
        
        
        
            Conference_Titel : 
High-Performance Computer Architecture, 2005. HPCA-11. 11th International Symposium on
         
        
        
            Print_ISBN : 
0-7695-2275-0
         
        
        
            DOI : 
10.1109/HPCA.2005.30