• DocumentCode
    1684685
  • Title

    Collaborative fault diagnosis in grids through automated tests

  • Author

    Duarte, Alexandre ; Brasileiro, Francisco ; Cirne, Walfredo ; Filho, José Alencar

  • Author_Institution
    Univ. Fed. de Campina Grande, Brazil
  • Volume
    1
  • fYear
    2006
  • Abstract
    Grids have the potential to revolutionize computing by providing ubiquitous, on demand access to computational services and resources. However, grid systems are extremely large, complex and prone to failures. A survey we have conducted reveals that fault diagnosis is still a major problem for grid users. When a failure appears at the user screen, it becomes very difficult for the user to identify whether the problem is in his application, somewhere in the grid middleware, or even lower in the fabric that comprises the grid. To overcome this problem, we argue that current grid platforms must be augmented with a collaborative diagnosis mechanism. We propose for such mechanism to use automated tests to identify the root cause of a failure and propose the appropriate fix. We also present a Java-based implementation of the proposed mechanism, which provides a simple and flexible framework that eases the development and maintenance of the automated tests.
  • Keywords
    Java; automatic testing; computational complexity; fault diagnosis; grid computing; middleware; ubiquitous computing; Java-based implementation; automated tests; collaborative fault diagnosis; computational services; grid middleware; grid systems; ubiquitous computing; Automatic testing; Collaboration; Computer crashes; Fabrics; Fault diagnosis; Grid computing; Java; Middleware; Parallel processing; Pervasive computing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Information Networking and Applications, 2006. AINA 2006. 20th International Conference on
  • ISSN
    1550-445X
  • Print_ISBN
    0-7695-2466-4
  • Type

    conf

  • DOI
    10.1109/AINA.2006.127
  • Filename
    1620172