Title : 
The architectural costs of streaming I/O: A comparison of workstations, clusters, and SMPs
         
        
            Author : 
Arpaci-Dusseau, Remzi H. ; Arpaci-Dusseau, Andrea C. ; Culler, David E. ; Hellerstein, Joseph M. ; Patterson, David A.
         
        
            Author_Institution : 
Div. of Comput. Sci., California Univ., Berkeley, CA, USA
         
        
        
        
        
        
            Abstract : 
We investigate resource usage while performing streaming I/O by contrasting three architectures, a single workstation, a cluster, and an SMP, under various I/O benchmarks. We derive analytical and empirically-based models of resource usage during data transfer, examining the I/O bus, memory bus, network, and processor of each system. By investigating each resource in detail, we assess what comprises a well-balanced system for these workloads. We find that the architectures we study are not well balanced for streaming I/O applications. Across the platforms, the main limitation to attaining peak performance is the CPU, due to lack of data locality. Increasing processor performance (especially with improved block operation performance) will be of great aid for these workloads in the future. For a cluster workstation, the I/O bus is a major system bottleneck, because of the increased load placed on it from network communication. A well-balanced cluster workstation should have copious I/O bus bandwidth, perhaps via multiple I/O busses. The SMP suffers from poor memory-system performance; even when there is true parallelism in the benchmark, contention in the shared-memory system leads to reduced performance. As a result, the clustered workstations provide higher absolute performance for streaming I/O workloads
         
        
            Keywords : 
local area networks; parallel architectures; performance evaluation; shared memory systems; I/O bus; SMPs; architectural costs; cluster workstation; clusters; data locality; memory bus; peak performance; processor performance; resource usage; shared-memory system; streaming I/O; workstations; Bandwidth; Computer architecture; Computer science; Costs; Peak to average power ratio; Performance gain; Switched-mode power supply; System performance; Thumb; Workstations;
         
        
        
        
            Conference_Titel : 
High-Performance Computer Architecture, 1998. Proceedings., 1998 Fourth International Symposium on
         
        
            Conference_Location : 
Las Vegas, NV
         
        
            Print_ISBN : 
0-8186-8323-6
         
        
        
            DOI : 
10.1109/HPCA.1998.650549