Please use this identifier to cite or link to this item: https://doi.org/10.1145/1272366.1272409
DC FieldValue
dc.titleFault-tolerant scheduling for differentiated classes of tasks with low replication cost in computational grids
dc.contributor.authorZheng, Q.
dc.contributor.authorVeeravalli, B.
dc.contributor.authorTham, C.-K.
dc.date.accessioned2014-06-19T03:10:48Z
dc.date.available2014-06-19T03:10:48Z
dc.date.issued2007
dc.identifier.citationZheng, Q.,Veeravalli, B.,Tham, C.-K. (2007). Fault-tolerant scheduling for differentiated classes of tasks with low replication cost in computational grids. Proceedings of the 16th International Symposium on High Performance Distributed Computing 2007, HPDC'07 : 239-240. ScholarBank@NUS Repository. <a href="https://doi.org/10.1145/1272366.1272409" target="_blank">https://doi.org/10.1145/1272366.1272409</a>
dc.identifier.isbn1595936734
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/70320
dc.description.abstractFault-tolerant scheduling is an imperative step for large-scale computational Grid systems, as often geographically distributed nodes co-operate to execute a task. By and large, the primary-backup approach is a common methodology used for fault tolerance where in each task has a primary copy and a backup copy on two different processors. Backup overloading has been proposed to reduce replication cost by allowing the backup copy to overload with other backup copies on the same processor. In this paper, we consider two classes of independent tasks where in both the classes have fault-tolerance requirements. Furthermore, Class 1 tasks require the response time to be as short as possible when a fault occurs, while Class 2 tasks prefer backups with minimum replication cost. We propose two algorithms, called the MRC-ECT algorithm and the MCT-LRC algorithm. Algorithm MRC-ECT is shown to guarantee an optimal backup schedule in terms of replication cost, while MCT-LRCcan schedule a backup with minimum completion time and low replication cost. We conduct extensive simulation experiments to quantify the performance of the proposed algorithms.
dc.description.urihttp://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1145/1272366.1272409
dc.sourceScopus
dc.subjectComputational grids
dc.subjectDifferentiated tasks
dc.subjectFault tolerance
dc.subjectReplication cost
dc.subjectResponse time
dc.subjectScheduling
dc.typeConference Paper
dc.contributor.departmentELECTRICAL & COMPUTER ENGINEERING
dc.description.doi10.1145/1272366.1272409
dc.description.sourcetitleProceedings of the 16th International Symposium on High Performance Distributed Computing 2007, HPDC'07
dc.description.page239-240
dc.identifier.isiutNOT_IN_WOS
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.