Bibliography as of August 1998

Journals, Book Chapters and Trade Publications
  1. Message logging: pessimistic, optimistic, causal, and optimal. IEEE Transactions on Software Engineering, Feb. 1998, 24(2):149-59 (with L. Alvisi).
  2. Efficient Detection of a Class of Stable Properties. Distributed Computing 8(2):81-91, 1994. (with L. Sabel).
  3. Refinement for Fault-Tolerance: An Aircraft Handoff Protocol. In Foundations of Ultradependable Computing (G. Koob and C. L. Lau, editors), Volume 2, Chapter 1.2, pp. 39-54. Kluwer Academic Press (with F. B. Schneider and J. Dehn).
  4. Consistent Detection of Global Predicates. In Global States and Time in Distributed Systems (Z. Yang and T. Marsland, editors), IEEE Computer Society Press 1994, pp. 37-43 (with R. C. B. Cooper).
  5. Priority Inversion and Its Prevention. Journal of Real-time Systems 5, 285-303 (1993) (with O. Babaoglu and F. B. Schneider).
  6. Consistent Global States of Distributed Systems: Fundamental Concepts and Mechanisms. In Distributed Systems (S. Mullender, editor), Chapter 2. ACM Press, 1993 (with O. Babaoglu).
  7. The Primary-Backup Approach. In Distributed Systems (S. Mullender, editor), Chapter 2. ACM Press, 1993 (with N. Budhiraja, F. B. Schneider and S. Toueg).
  8. The ISIS distributed programming toolkit and the Meta distributed operating system: a brief overview. In Mission critical operating systems. (edited by A. K. Agrawala, K. D. Gordon, K.D. and P. Hwang). Amsterdam, Netherlands: IOS Press, 1992. p. 32-35 (with K. Birman).
  9. Derivation of Sequential, Real-Time, Process-Control Programs. In Foundations of Real-Time Computing: Formal Specifications and Methods, Kluwer Academic Publishers 1991, pp 39-54 (with N. Budhiraja and F. B. Schnieider).
  10. Tools for Monitoring and Controlling Distributed Applications.IEEE Computer, Volume 24, number 8 (August 1991), pp 42-51. (with K. Birman, R. Cooper and M. Wood).
  11. Tolerating Failures of Continuous-Valued Sensors. ACM Transactions on Computer Systems Volume 8, number 4 (November 1990), pp 284-304.
  12. ISIS and the Meta Project. Sun Technology, 2, Number 3 (Summer 1989) (with K. Birman).
Conference and Workshop Proceedings
  1. The Bancomat problem: an example of resource allocation in a wide-area distributed system. In Proceedings of DISC '98, October 1998, to appear (with J. Sussman).
  2. WAFT: support for fault-tolerance in wide-area object oriented systems, In Information Survivability Workshop '98, Orlando, Florida, October 1998, to appear (with L. Alvisi).
  3. Wide-area Nile: a case study of a wide-area data-parallel application.  In Proceeding 18th International Conference on Distributed Computing Systems, Amsterdam, Netherlands, 26-29 May 1998, pp. 506-515 (with A, Amoroso and A. Ricciardi).
  4. A low-cost processor group membership protocol for a hard real-time distributed system. In Proceedings 18th IEEE Real-Time Systems Symposium, San Francisco, CA, USA, 2-5 Dec. 1997, pp. 90-98 (with M. Clegg).
  5. Predicting physical processes in the presence of faulty sensor readings. In Digest of Papers, Twenty-Seventh Annual International Symposium on Fault-Tolerant Computing, Seattle, WA, USA, 24-27 June 1997, pp. 373-378 (with M. Clegg).
  6. NILE: Wide-Area Computing for High-Energy Physics. Proceedings of the Seventh ACM SIGOPS European Workshop, Connemara, Ireland, 9-11 September 1996, pp. 49-54 (with M. Ogg, A. Ricciardi, A. Amoroso, F. Andrew Calkins, and E. Rothfus).
  7. Trade-offs in implementing causal message logging protocols. In Proceedings of the Fifteenth Annual ACM Symposium on Principles of Distributed Computing, Philadelphia, PA, USA, 23-26 May 1996, pp. 58-67 (with L. Alvisi).
  8. Comparing primary-backup and state machines for crash failures. In Proceedings of the Fifteenth Annual ACM Symposium on Principles of Distributed Computing, Philadelphia, PA, USA, 23-26 May 1996, p. 90 (with J. Sussman, short paper).
  9. The NILE System Architectures. Proceedings of the Eleventh International Conference on Systems Engineering, Las Vegas, USA, 9-11 July 1996, pp. 414-419 (with M. Ogg and A. Ricciardi).
  10. Tradeoffs in Implementing Primary-Backup Protocols. Proceedings of the Seventh IEEE Symposium on Parallel and Distributed Processing, October 1995, pp. 280-288 (with N. Budhiraja).
  11. Deriving optimal checkpoint protocols for distributed shared memory architectures. Brief Announcements, Proceedings of the Fourteenth Annual ACM Symposium on Principles of Distributed Computing, Ottawa, Ontario, Canada, 20--23 August 1995, pp. 263 (with L. Alvisi).
  12. Deriving optimal checkpoint protocols for distributed shared memory architectures. Selected Papers, International Workshop in Theory and Practice in Distributed Systems, K. Birman, F. Mattern and A. Schiper, editors, Springer-Verlag 1995, pp. 111-120 (with L. Alvisi).
  13. Message Logging: Pessimistic, Optimistic, Causal and Optimal. Proceedings of the Fifteenth International Conference on Distributed Computing Systems, May 1995, pp. 229-236 (with L. Alvisi).
  14. Simulating Fail-Stop in Asynchronous Distributed Systems. Proceedings of the Thirteenth Symposium on Reliable Distributed Systems, October 1994, pp. 138-147 (with L. Sabel).
  15. Nonblocking and Orphan-Free Message Logging Protocols. Proceedings of the 23rd Fault-Tolerant Computing Symposium, June 1993, pp. 145-154 (with L. Alvisi and B. Hoppe).
  16. Primary-Backup Protocols: Lower Bounds and Optimal Implementations. Proceedings of the Third IFIP Conference on Dependable Computing for Critical Applications, Mondello, Sicily, September 1992, pp. 321-343 (with N. Budhiraja, F. B. Schneider and S. Toueg).
  17. Monitoring and Controlling Distributed Applications using Lomita. Proceedings of the First Workshop on Systems Management, Los Angeles CA, May 1993, pp. 45-53 (with I. Szafranska).
  18. High Availability in a Real-Time System. Proceedings of the European SIGOPS Workshop, published in ACM Operating Systems Review, 27, 2, (April 1993), 82-87 (position paper with C. Almeida, B. Glade and R. van Renesse).
  19. Optimal primary-backup protocols. Proceedings of the International Workshop on Distributed Algorithms (WDAG '92) Haifa, Israel, 2-4 Nov. 1992). Edited by: Segall, A.; Zaks, S. Berlin, Germany: Springer-Verlag, 1992. p. 362-378.
  20. Putting time into proof outlines. Real-Time: Theory in Practice. REX Workshop Proceedings, Mook, Netherlands, 3-7 June 1991). Edited by: de Bakker, J.W.; Huizing, C.; de Roever, W.P.; Rozenberg, G. Berlin, Germany: Springer-Verlag, 1992, pp 618-639 (with B. Bloom and F. B. Schneider).
  21. Lower bounds for primary-backup implementations of Bofo Services. Proceedings ONR 2nd Annual Workshop on Ultradependable Multicomputers and Electronic Systems, Washington D.C. November 1991, pp 81-86 (with Navin Budhiraja, Fred B. Schneider and Sam Toueg).
  22. Detection of Global State Predicates. Proceedings of the Fifth Workshop on Distributed Algorithms and Graphs (Springer-Verlag LNCS 579) pp 254-272, Delphi, Greece, October 1991 (with G. Neiger).
  23. Using Consistent Subcuts to detect Stable Properties. Proceedings of the Fifth Workshop on Distributed Algorithms and Graphs (Springer-Verlag LNCS 579) pp 273-288, Delphi, Greece, October 1991 (with L. Sabel).
  24. Masking Failures of Multidimensional Sensors. Proceedings of the Tenth Symposium on Reliable Distributed Systems, Pisa, Italy, October 1991, pp 32-41 (with P. Chew).
  25. Tools for Distributed Application Management. In Proceedings of the Spring 1991 EurOpen Conference, Tromso, Norway, May 1991, pp 185-196 (with M. Wood).
  26. Consistent Detection of Global Predicates. Proceedings of the ACM/ONR Workshop on Parallel and Distributed Debugging, Santa Cruz, CA, May 1991 (with R. Cooper), pp 163-173.
  27. Making Real-Time Reactive Systems Reliable. Proceedings of the Fourth ACM SIGOPS European Workshop, September 1990 (position paper with M. Wood).
  28. Deceit: A Flexible Distributed File System. Proceedings of the Summer 1990 USENIX Conference, June 1990, pp 51-61 (with K. Birman and A. Siegel).
  29. Supplying High Availability with a Standard Network File System. Proceedings Eighth International Conference on Distributing Computing Systems, San Jose, CA., June 1988, (with F. Schmuck).
  30. Configuration Management for Modular Systems. International Workshop on Software Version and Configuration Control, Grassau, FRG, January 1988 (position paper).
  31. RNFS: A Highly Available Network File Service. Workshop on Workstation Operating Systems, Boston, MA., November 1987 (with S. Armstrong, et.al.)
  32. Jasmine: A Software System Modeling Facility. Proceedings 2nd ACM SIGSOFT/SIGPLAN Software Engineering Conference on Practical Software Development Environments, December 1986 (with D. Wiebe).
  33. Maintaining the Time in a Distributed System. Second ACM SIGACT-SIGOPS Symposium on the Principals of Distributed Computing, August 1983 (with S. Owicki).
  34. Multicast Transport Protocol. Internet RFC 1301, February 1992 (with S. Armstrong and A. Freier).
  35. Election Vs. Consensus in Asynchronous System. University of California, San Diego Department of Computer Science and Engineering Technical Report CS95-411, February 1995 (with L. Sabel).
  36. Clock Synchronization in Hard Real-Time Distributed Systems University of California, San Diego Department of Computer Science and Engineering Technical Report CS96-478, February 1996 (with M. Clegg).