• DocumentCode
    379065
  • Title

    Model oriented profiling of parallel programs

  • Author

    Gonzalez, Jose A. ; Leon, Coromoto ; Roda, J.L. ; Printista, M.

  • Author_Institution
    Dpto. Estadistica, I.O. y Computacion, Univ. de La Laguna, Spain
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    39
  • Lastpage
    45
  • Abstract
    The prediction analysis model presented extends BSP to cover both oblivious synchronization and group partitioning. These generalizations imply that different processors may finish the same superstep at different times. The other consideration is that, even if the numbers of individual communication or computation operations in two stages are the same, the actual times for these two stages may differ. These differences are due to the separate nature of the operations or to the particular pattern followed by the messages. Even worse, the assumption that a constant number of machine instructions takes constant time is far from the truth. Current memory hierarchies imply that memory access vary from a few cycles to several thousands. A natural proposal is to associate a different proportionality constant with each basic block, and analogously, to associate different latencies and bandwidths with each "communication block". Unfortunately, to use this approach implies that the evaluation parameters not only depend on given architecture, but also reflect algorithm characteristics. Such parameter evaluation must be done for every algorithm. This is a heavy task, implying experiment design, timing, statistics, pattern recognition and multi-parameter fitting algorithms. Software support is required. We have developed a compiler that takes as source a C program annotated with complexity formulas and produces as output an instrumented code. The trace files obtained from the execution of the resulting code are analyzed with an interactive interpreter giving us, among other information, the values of those parameters
  • Keywords
    parallel programming; program compilers; program diagnostics; program interpreters; software performance evaluation; synchronisation; BSP; Bulk Synchronous Programming software; C program; communication block; compiler; complexity formulas; computation operations; current memory hierarchies; evaluation parameters; experiment design; group partitioning; instrumented code; interactive interpreter; machine instructions; memory access; model oriented profiling; multi-parameter fitting algorithms; oblivious synchronization; parallel program profiling; parameter evaluation; pattern recognition; prediction analysis model; proportionality constant; software support; superstep; trace files; Algorithm design and analysis; Bandwidth; Computer architecture; Delay; Pattern recognition; Predictive models; Program processors; Proposals; Statistics; Timing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel, Distributed and Network-based Processing, 2002. Proceedings. 10th Euromicro Workshop on
  • Conference_Location
    Canary Islands
  • Print_ISBN
    0-7695-1444-8
  • Type

    conf

  • DOI
    10.1109/EMPDP.2002.994212
  • Filename
    994212