• DocumentCode
    1142497
  • Title

    Design and evaluation of a fault-tolerant mobile-agent system

  • Author

    Lyu, Michael R. ; Chen, Xinyu ; Wong, Tsz Yeung

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Chinese Univ. of Hong Kong, China
  • Volume
    19
  • Issue
    5
  • fYear
    2004
  • Firstpage
    32
  • Lastpage
    38
  • Abstract
    The mobile agents create a new paradigm for data exchange and resource sharing in rapidly growing and continually changing computer networks. In a distributed system, failures can occur in any software or hardware component. A mobile agent can get lost when its hosting server crashes during execution, or it can get dropped in a congested network. Therefore, survivability and fault tolerance are vital issues for deploying mobile-agent systems. This fault tolerance approach deploys three kinds of cooperating agents to detect server and agent failures and recover services in mobile-agent systems. An actual agent is a common mobile agent that performs specific computations for its owner. Witness agents monitor the actual agent and detect whether it´s lost. A probe recovers the failed actual agent and the witness agents. A peer-to-peer message-passing mechanism stands between each actual agent and its witness agents to perform failure detection and recovery through time-bounded information exchange; a log records the actual agent´s actions. When failures occur, the system performs rollback recovery to abort uncommitted actions. Moreover, our method uses checkpointed data to recover the lost actual agent.
  • Keywords
    client-server systems; fault tolerant computing; message passing; mobile agents; system recovery; checkpointed data; data exchange; failure detection; failure recovery; fault tolerance; mobile agent system; peer-to-peer message passing mechanism; resource sharing; rollback recovery; survivability; Computer crashes; Computer networks; Condition monitoring; Fault detection; Fault tolerant systems; Hardware; Mobile agents; Network servers; Probes; Resource management; fault tolerance; mobile agent; probe; stochastic Petri net; witness agent; witnessing dependency;
  • fLanguage
    English
  • Journal_Title
    Intelligent Systems, IEEE
  • Publisher
    ieee
  • ISSN
    1541-1672
  • Type

    jour

  • DOI
    10.1109/MIS.2004.40
  • Filename
    1347066