Logo image
Quantifying and improving the availability of cooperative cluster-based internet services
Technical documentation   Open access

Quantifying and improving the availability of cooperative cluster-based internet services

Kiran Nagaraja, Neeraj Krishnan, Ricardo Bianchini, Richard P. Martin and Thu Nguyen
Rutgers University
2003
DOI:
https://doi.org/10.7282/t3-k4r1-4890

Abstract

Much research has shown that cluster-based servers can substantially increase performance when nodes cooperate to share and globally manage their resources. In this paper, we apply a quantification methodology to show that this performance increase has a corresponding substantial cost in availability. Specifically, we show that a sophisticated cluster-based web server gains a factor of 3 in performance when nodes cooperate to balance load and jointly manage their memories, but also suffers an increase in unavailability of a factor of 10. We then show how this web server can be augmented with Commercial Off-The-Shelf (COTS) components embodying a small set of high-availability techniques to regain the lost availability. Among other interesting observations, we show that the application of multiple high-availability techniques, each implemented independently in its own subsystem, can lead to inconsistent recovery actions. We also show that a novel technique called Fault Model Enforcement can be used to resolve such inconsistencies. Augmenting the server with these techniques led to a final predicted availability of close to 99.99.
pdf
dcs-tr-517321.95 kBDownloadView
Version of Record (VoR) Technical Documentation Open Access
url
Report an accessibility issueView
Please complete a content remediation request to report an accessibility issue with a library electronic resource, website, or service.

Metrics

67 File downloads
62 Record Views

Details

Logo image