Open Access Open Access  Restricted Access Subscription or Fee Access

An Efficient Checkpointing Algorithm for Mobile Distributed Systems

Surender Kumar, R.K. Chauhan, Parveen Kumar

Abstract


Mobile distributed system is a distributed system with mobile node as well as static node. Due to unique characteristic of mobile nodes likes mobility, limited bandwidth on wireless link, limited battery power backup, limited local storage, frequent disconnection and the handoff, traditional fault tolerance scheme can not be directly applied in mobile distributed systems. Non blocking coordinated checkpointing algorithm which forces minimum number of process to take their checkpoint, are preferred approach for mobile distributed systems. To achieve the goal of non-blocking and minimum- process sometimes, it also requires piggybacking of some extra information onto normal messages or taking some useless checkpoints. As during checkpointing lot of checkpointing information including all variables, the environment, control information register value are transmitted through the wireless links. So these forced/useless checkpoints increase the checkpointing overheads as these are transmitted through the wireless link and require large communication bandwidth and energy. Therefore our efforts are mostly directed towards avoiding the useless checkpoints. In this paper we propose a minimum process non-blocking coordinated checkpointing algorithm which does not take any useless checkpoint and produce consistent global state with minimum number of checkpoints. This approach can significantly save battery energy as well as wireless communication bandwidths.

Keywords


Distributed Systems, Mobile Distributed Systems, Domino Effect, Checkpointing, Coordinated, Uncoordinated

Full Text:

PDF

References


Acharya A. and Badrinath B. R., Checkpointing Distributed Applications on Mobile Computers, Proceedings of the 3rd International Conference on Parallel and Distributed Information Systems, pp. 73-80, September 1994.

Prakash R. and Singhal M., Low-Cost Checkpointing and Failure Recovery in Mobile Computing Systems, IEEE Transaction On Parallel and Distributed Systems, vol. 7, no. 10, pp. 1035-1048, October1996.

Cao G. and Singhal M., On coordinated checkpointing in Distributed Systems, IEEE Transactions on Parallel and Distributed Systems, vol. 9, no.12, pp. 1213-1225, Dec 1998.

Cao G. and Singhal M., Mutable Checkpoints: A New Checkpointing Approach for Mobile Computing systems, IEEE Transaction On Parallel and Distributed Systems, vol. 12, no. 2, pp. 157-172, February 2001.

Elnozahy E.N., Alvisi L., Wang Y.M. and Johnson D.B., A Survey of Rollback-Recovery Protocols in Message-Passing Systems, ACM Computing Surveys, vol. 34, no. 3, pp. 375- 408, 2002.

Chandy K. M. and Lamport L., Distributed Snapshots: Determining Global State of Distributed Systems, ACM Transaction on Computing Systems, vol. 3, No. 1, pp. 63-75, February 1985.

Cao G. and Singhal M., On the Impossibility of Min-process Non-blocking Checkpointing and an Efficient Checkpointing Algorithm for Mobile Computing Systems, Proceedings of International Conference on Parallel Processing, pp. 37-44, August 1998.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.