• DocumentCode
    1451804
  • Title

    The Hector distributed run-time environment

  • Author

    Russ, Samuel H. ; Robinson, Jonathan ; Flachs, Brian K. ; Heckel, Bjorn

  • Author_Institution
    Eng. Res. Center, Mississippi State Univ., MS, USA
  • Volume
    9
  • Issue
    11
  • fYear
    1998
  • fDate
    11/1/1998 12:00:00 AM
  • Firstpage
    1102
  • Lastpage
    1114
  • Abstract
    Harnessing the computational capabilities of a network of workstations promises to off-load work from overloaded supercomputers onto largely idle resources overnight. Several capabilities are needed to do this, including support for an architecture-independent parallel programming environment, task migration, automatic resource allocation, and fault tolerance. The Hector distributed run-time environment is designed to present these capabilities transparently to programmers. MPI programs can be run under this environment on homogeneous clusters with no modifications to their source code needed. The design of Hector, its internal structure, and several benchmarks and tests are presented
  • Keywords
    fault tolerant computing; parallel programming; programming environments; resource allocation; Hector distributed run-time environment; MPI programs; architecture-independent parallel programming environment; automatic resource allocation; benchmarks; fault tolerance; network of workstations; task migration; Availability; Computer networks; Fault tolerance; Load management; Parallel programming; Programming profession; Resource management; Runtime environment; Supercomputers; Workstations;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/71.735957
  • Filename
    735957