Fault-tolerant scheduling for differentiated classes of tasks with low replication cost in computational grids

Please use this identifier to cite or link to this item: https://doi.org/10.1145/1272366.1272409

DC Field	Value
dc.title	Fault-tolerant scheduling for differentiated classes of tasks with low replication cost in computational grids
dc.contributor.author	Zheng, Q.
dc.contributor.author	Veeravalli, B.
dc.contributor.author	Tham, C.-K.
dc.date.accessioned	2014-06-19T03:10:48Z
dc.date.available	2014-06-19T03:10:48Z
dc.date.issued	2007
dc.identifier.citation	Zheng, Q.,Veeravalli, B.,Tham, C.-K. (2007). Fault-tolerant scheduling for differentiated classes of tasks with low replication cost in computational grids. Proceedings of the 16th International Symposium on High Performance Distributed Computing 2007, HPDC'07 : 239-240. ScholarBank@NUS Repository. <a href="https://doi.org/10.1145/1272366.1272409" target="_blank">https://doi.org/10.1145/1272366.1272409</a>
dc.identifier.isbn	1595936734
dc.identifier.uri	http://scholarbank.nus.edu.sg/handle/10635/70320
dc.description.abstract	Fault-tolerant scheduling is an imperative step for large-scale computational Grid systems, as often geographically distributed nodes co-operate to execute a task. By and large, the primary-backup approach is a common methodology used for fault tolerance where in each task has a primary copy and a backup copy on two different processors. Backup overloading has been proposed to reduce replication cost by allowing the backup copy to overload with other backup copies on the same processor. In this paper, we consider two classes of independent tasks where in both the classes have fault-tolerance requirements. Furthermore, Class 1 tasks require the response time to be as short as possible when a fault occurs, while Class 2 tasks prefer backups with minimum replication cost. We propose two algorithms, called the MRC-ECT algorithm and the MCT-LRC algorithm. Algorithm MRC-ECT is shown to guarantee an optimal backup schedule in terms of replication cost, while MCT-LRCcan schedule a backup with minimum completion time and low replication cost. We conduct extensive simulation experiments to quantify the performance of the proposed algorithms.
dc.description.uri	http://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1145/1272366.1272409
dc.source	Scopus
dc.subject	Computational grids
dc.subject	Differentiated tasks
dc.subject	Fault tolerance
dc.subject	Replication cost
dc.subject	Response time
dc.subject	Scheduling
dc.type	Conference Paper
dc.contributor.department	ELECTRICAL & COMPUTER ENGINEERING
dc.description.doi	10.1145/1272366.1272409
dc.description.sourcetitle	Proceedings of the 16th International Symposium on High Performance Distributed Computing 2007, HPDC'07
dc.description.page	239-240
dc.identifier.isiut	NOT_IN_WOS
Appears in Collections:	Staff Publications

Show simple item record

Files in This Item:

There are no files associated with this item.

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Altmetric

Google Scholar^TM