Abstract :
Distributed computing, in which an application runs over multiple independent computing nodes, has a higher risk of one or more nodes failing than a centralized, single-node environment. On the other hand, distributed computing can also make an overall system more dependable by detecting those faulty nodes - whether they´re due to an underlying hardware or software failure or to compromised security through malicious attacks and then redistributing application components or coordinating them via predefined protocols to avoid such problems. So, traditional dependability studies focus on fault detection, protocols for redistributing application components and coordinating them across nodes, and even failure estimation using system and component characterization.
Keywords :
distributed processing; distributed computing; reusable services; runtime management; service-oriented computing; Availability; Distributed computing; Distributed databases; Fault detection; Fault tolerant systems; Hardware; Maintenance; Protocols; Runtime; Security; dependable service-oriented computing; service-level agreements; service-oriented architectures;