CISUC

Using Checkpointing to Enhance Turnaround Time on Institutional Desktop Grids

Authors

Abstract

In this paper, we present a checkpoint-based scheme to improve the turnaround time of bag-of-tasks applications executed on institutional desktop grids. We propose to share checkpoints among desktop machines in order to reduce the negative impact of resource volatility. Several scheduling policies are evaluated in our study: FCFS, adaptive timeouts, simple replication, replication with checkpoint on demand, and prediction-based checkpointing combined with replication.

We used a set of real traces collected from an academic desktop grid environment to perform trace-driven simulations of the proposed scheduling algorithms. The results show that using a shared checkpoint approach may considerably reduce the turnaround time of the applications when compared to the private checkpoints methodology.

Subject

Checkpointing, Desktop grid

PDF File


Cited by

Year 2010 : 3 citations

 Abbes, H. and Cérin, C. and Jemni, M., “A decentralized and fault-tolerant Desktop Grid system for distributed applications”, Concurrency and Computation: Practice and Experience, volume 22, issue 3, pp. 261-277, 2010

 Yi, S. and Kondo, D. and Andrzejak, A., “Reducing costs of spot instances via checkpointing in the amazon elastic compute cloud”, 2010 IEEE 3rd International Conference on Cloud Computing, pp. 236-243, 2010

 Abbes, H. and Butelle, F. and Cérin, C., “Multithreading of Kostka Numbers Computation for the BonjourGrid Meta-desktop Grid Middleware”, Algorithms and Architectures for Parallel Processing, pp. 287-298, 2010

Year 2009 : 2 citations

 Heithem Abbes, Christophe Cerin, Mohamed Jemni, “BonjourGrid: Orchestration of Multi-instances of Grid Middlewares on Institutional Desktop Grids”, Proc. 3rd Workshop on Desktop Grids and Volunteer Computing Systems (PCGrid 2009), Rome, Italy, In Workshops Proceedings of the 23rd IEEE International Parallel and Distributed Processing Symposium, IEEE Computer Society

 [PDF] http://www.slideshare.net/brent.wilson/distributed-checkpointing-on-an-enterprise-desktop-grid
Brent Wilson, “Distributed Checkpointing on an Enterprise Desktop Grid”, PhD Thesis, Colorado Technical University, 2009.

Year 2008 : 1 citations

 Abbes, H. and Cérin, C. and Jemni, M., “BonjourGrid as a decentralised job scheduler”, 2008 IEEE Asia-Pacific Services Computing Conference, pp.89-94, 2008

Year 2007 : 1 citations

 Y. Georgiou, O. Richard, N. Capit, "Evaluations of the Lightweight Grid CIGRI upon the Grid5000 Platform", IEEE International Conference on e-Science and Grid Computing, pp 279-286, December 2007