Author_Institution :
IBM Almaden Res. Center, San Jose, CA, USA
Abstract :
Programming techniques necessary for high performance on the 3090 Vector Facilities are illustrated, showing that VS Fortran programs can achieve near-maximum execution rates. Relevant features of the 3090 architecture are reviewed, stressing the need to make efficient use of a hierarchical storage system and take advantage of the compound vector instructions. The key programming techniques for managing the storage hierarchy are loop sectioning, loop distribution, and data compaction. Vector register, cache reuse, and virtual memory, storage format, and page reuse are shown to lead to efficient use of the vector registers, the high speed cache, and the virtual memory system, respectively. The multiply-and-add compound instruction is discussed.<>
Keywords :
IBM computers; parallel architectures; parallel machines; parallel programming; storage allocation; 3090 architecture; IBM 3090; VS Fortran; cache reuse; data compaction; hierarchical storage system; high speed cache; loop distribution; loop sectioning; maximum vector performance; multiply-and-add compound instruction; page reuse; programming techniques; storage format; storage hierarchy; virtual memory; Algorithms; Assembly; Computer architecture; Libraries; Memory management; Power engineering and energy; Power engineering computing; Program processors; Programming profession; Registers;