Related papers

Related Papers on the Web

B. Janssens and W. K. Fuchs, ``The Performance of Cache-Based Error Recovery in Multiprocessors,'' IEEE Trans. on Parallel and Distributed Systems, Vol. 5, No. 10, Oct. 1994, pp 1033--1043.

B. Janssens and W. K. Fuchs, ``Ensuring Correct Rollback Recovery in Distributed Shared Memory Systems,'' Journal of Parallel and Distributed Computing (Special Issue on Distributed Shared Memory), Oct. 1995, to appear.

G. Suri, B. Janssens and W. K. Fuchs, ``Reduced Overhead Logging for Rollback Recovery in Distributed Shared Memory,'' Proc. 25th Int. Symp. on Fault-Tolerant Computing, June 1995, to appear.

A.-M. Kermarrec, G. Cabillic, A. Gefflaut, C. Morin, and I. Puaut, ``A recoverable distributed shared memory integrating coherence and recoverability,'' Tech. Report 897, IRISA, Rennes, France, Jan. 1995.

M. J. Feeley, J. S. Chase, V. Narasayya, and H. M. Levy, ``Integrating coherency and recovery in distributed systems,'' Proc. Symp. on Operating Systems Design and Implementation, Nov. 1994.

A. Gefflaut, C. Morin, and M. Banâtre, ``Tolerating node failures in cache only memory architectures,'' Proc. Supercomputing '94, Nov. 1994.

B. Janssens and W. K. Fuchs, ``Reducing interprocessor dependence in recoverable distributed shared memory,'' Proc. 13th Symp. on Reliable Distributed Systems, Oct. 1994, pp. 34--41.

G. Janakiraman and Y. Tamir, ``Coordinated checkpointing-rollback error recovery for distributed shared memory multicomputers,'' Proc. 13th Symp. on Reliable Distributed Systems, Oct. 1994, pp. 42-51.

N. Neves, M. Castro, P. Guedes, ``A checkpoint protocol for an entry consistent shared memory system,'' Proc. 13th ACM Symp. on Principles of Distributed Computing, Aug. 1994.

T. Fuchi and M. Tokoro, ``A mechanism for recoverable shared virtual memory,'' manuscript, U. of Tokyo, May 1994.

G. G. Richard III and M. Singhal, ``Using logging and asynchronous checkpointing to implement recoverable distributed shared memory,'' Proc. 12th Symp. on Reliable Distributed Systems, 1993, pp 58--67.

L. Gunaseelan and R. J. LeBlanc, ``Event ordering in a shared memory distributed system,'' Proc. 13th Int. Conf. on Distributed Computing Systems, 1993, pp 256--263.

B. Janssens and W. K. Fuchs, ``Relaxing consistency in recoverable distributed shared memory,'' Proc. 23rd Int. Symp. on Fault-Tolerant Computing, July 1993, pp. 155--163.

B. D. Fleisch, ``Reliable distributed shared memory,'' Proc. 2nd IEEE Workshop on Experimental Distributed Systems, 1990, pp. 102--105.

M. Stumm and S. Zhou, ``Fault tolerant distributed shared memory,'' Proc. 2nd IEEE Symp. on Parallel and Distributed Processing, 1990, pp. 719--724.

V.-O Tam and M. Hsu, ``Fast recovery in distributed shared virtual memory systems,'' Proc. 10th Int. Conf. on Distributed Computer System, 1990, pp. 38--45.

K.-L. Wu and W. K. Fuchs, ``Recoverable distributed shared virtual memory,'' IEEE Trans. on Computers, Vol. 39, No. 4, Apr. 1990, pp. 460--469.

M. Banâtre, A. Gefflaut, P. Joubert, P. Lee, and C. Morin, ``An architecture for tolerating processor failures in shared-memory multiprocessors,'' alternate site, Tech. Report 707, IRISA, Rennes, France, Mar. 1993.

P. Joubert, Conception et évaluation d'une architecture multiprocesseur à mémoire partagée tolérante aux fautes, Doctoral Thesis, l'Université de Rennes 1, Jan. 1993.

M. S. Algudady, C. R. Das, M. J. Thazhuthaveetil, ``A cache-based checkpointing scheme for MIN-based multiprocessors,'' Proc. Int. Conf. on Parallel Processing, pp. I-497--I-500, Aug. 1991.

B. Janssens and W. K. Fuchs, ``Experimental evaluation of multiprocessor cache-based error recovery methods,'' Proc. Int. Conf. on Parallel Processing, Aug. 1991, pp. I-505--I-508.

B. Janssens, ``Generation of multiprocessor address traces and their use in the performance analysis of cache-based error recovery methods,'' MS Thesis, UIUC, Tech. Report CRHC-91-10.

R. E. Ahmed, R. C. Frazier, and P. N. Marinos, ``Cache-Aided Rollback error recovery (CARER) algorithms for shared-Memory multiprocessor systems,'' Proc. 20th Int. Symp. on Fault-Tolerant Computing, 1990, pp. 82--88.

M. Banâtre and P. Joubert, ``Cache management in a tightly coupled fault tolerant multiprocessor,'' in Proc. 20th Int. Symp. on Fault-Tolerant Computing, June 1990, pp. 89--96.

K.-L. Wu, W. K. Fuchs, and J. H. Patel, ``Error recovery in shared memory multiprocessors using private caches,'' IEEE Trans. on Parallel and Distributed Systems, Vol. 1, No. 2, Apr. 1990, pp. 231--240.

P. A. Bernstein, ``The Sequoia system,'' Lecture Notes in Computer Science: Fault-Tolerant Distributed Computing, B. Simons and A. Spector Eds., No. 448, pp 217--223.

P. A. Bernstein, ``Sequoia: a fault-tolerant tightly coupled multiprocessor for transaction processing,'' Computer, vol. 21, no. 2, Feb. 1988, pp. 37--45.

Y.-H. Lee and K. G. Shin, ``Design and evaluation of a fault-tolerant multiprocessor using hardware recovery blocks,'' IEEE Trans. on Computers, Vol. C-33, No. 2, Feb. 1984, pp. 113--124.