• DocumentCode
    1659995
  • Title

    Global communication optimization for tensor contraction expressions under memory constraints

  • Author

    Cociorva, Daniel ; Gao, Xiaoyang ; Krishnan, Sandhya ; Baumgartner, Gerald ; Lam, Chi-Chung ; Sadayappan, P. ; Ramanujam, J.

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Ohio State Univ., Columbus, OH, USA
  • fYear
    2003
  • Abstract
    The accurate modeling of the electronic structure of atoms and molecules involves computationally intensive tensor contractions involving large multi-dimensional arrays. The efficient computation of complex tensor contractions usually requires the generation of temporary intermediate arrays. These intermediates could be extremely large, but they can often be generated and used in batches through appropriate loop fusion transformations. To optimize the performance of such computations on parallel computers, the total amount of inter-processor communication must be minimized, subject to the available memory on each processor In this paper we address the memory-constrained communication minimization problem in the context of this class of computations. Based on a framework that models the relationship between loop fusion and memory usage, we develop an approach to identify the best combination of loop fusion and data partitioning that minimizes inter-processor communication cost without exceeding the per-processor memory limit. The effectiveness of the developed optimization approach is demonstrated on a computation representative of a component used in quantum chemistry suites.
  • Keywords
    atomic structure; chemistry computing; data structures; molecular electronic states; parallel programming; performance evaluation; program control structures; quantum chemistry; tensors; atom electronic structure; data partitioning; global communication optimization; intermediate arrays; large multi-dimensional arrays; loop fusion; memory constraints; memory usage; memory-constrained communication minimization; molecules; parallel computers; performance; quantum chemistry; tensor contraction expressions; Chemistry; Concurrent computing; Constraint optimization; Context; Costs; Fusion power generation; Global communication; Memory management; Quantum computing; Tensile stress;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium, 2003. Proceedings. International
  • ISSN
    1530-2075
  • Print_ISBN
    0-7695-1926-1
  • Type

    conf

  • DOI
    10.1109/IPDPS.2003.1213121
  • Filename
    1213121