Matti A Hiltunen et al 1999 Distrib. Syst. Engng. 6 103 doi:10.1088/0967-1846/6/3/302
Matti A Hiltunen, Vijaykumar Immanuel and Richard D Schlichting
Show affiliationsThe cost of employing software fault tolerance techniques in distributed systems is strongly related to the type of failures to be tolerated. For example, in terms of the amount of redundancy required and execution time, tolerating a processor crash is much cheaper than tolerating arbitrary (or Byzantine) failures. This paper describes an approach to constructing configurable services for distributed systems that allows easy customization of the type of failures to tolerate. Using this approach, it is possible to configure custom services across a spectrum of possibilities, from a very efficient but unreliable server group that does not tolerate any failures, to a less efficient but reliable group that tolerates crash, omission, timing, or arbitrary failures. The approach is based on building configurable services as collections of software modules called micro-protocols. Each micro-protocol implements a different semantic property or property variant, and interacts with other micro-protocols using an event-driven model provided by a runtime system. In addition to facilitating the choice of failure model, the approach allows service properties such as message ordering and delivery atomicity to be customized for each application.
07.05.Bx Computer systems: hardware, operating systems, computer languages, and utilities
84.40.Ua Telecommunications: signal transmission and processing; communication satellites
Issue 3 (September 1999)
Received 16 August 1999
Matti A Hiltunen et al 1999 Distrib. Syst. Engng. 6 103
B G Vaucher et al 1995 Metrologia 32 331
J T Lau et al 2002 New J. Phys. 4 98
A T Holster 2003 New J. Phys. 5 130
M V Berry 2002 New J. Phys. 4 74
A N Tripathi et al 1969 J. Phys. B: At. Mol. Phys. 2 155
A Dargys et al 1997 J. Phys.: Condens. Matter 9 L557
R S Davis 1995 Meas. Sci. Technol. 6 227
M Stock and R Goebel 2000 Metrologia 37 633
C. L. Bennett et al. 2003 ApJS 148 1