This paper presents a fault tolerant protocol for distributed Time Warp simulation. Based on the concept of global virtual time, we show that a distributed snapshot of Time Warp can be efficiently taken. A set of simple distributed snapshot algorithms and fault recovery algorithms are proposed. The distributed snapshot algorithms checkpoint the system states (distributed snapshots) from time to time. The fault recovery algorithms restore the system state from the most recent distributed snapshot taken by the distributed snapshot algorithms. This protocol is robust enough to tolerate failures occurring at any moment.
|Number of pages||11|
|Journal||Journal of Information Science and Engineering|
|State||Published - 1 Jun 1994|