Publications

Detailed Information

A recovery mechanism for errors caused by a late subjob in a system handling SLA-based Grid workflows

DC Field Value Language
dc.contributor.authorQuan, Dang Minh-
dc.contributor.authorAltmann, Jorn-
dc.date.accessioned2009-08-10T04:40:19Z-
dc.date.available2009-08-10T04:40:19Z-
dc.date.issued2008-05-
dc.identifier.citationInt. J. Web and Grid Services, Vol.4, No.1, pp.35-62en
dc.identifier.issn1741-1106 (print)-
dc.identifier.issn1741-1114 (online)-
dc.identifier.urihttps://hdl.handle.net/10371/6766-
dc.description.abstractSupporting SLAs (Service Level Agreements) for Grid-based
workflows requires providing mechanisms for handling errors (i.e., the
failures of subjobs). In the context of this paper, we propose an error
recovery mechanism which can handle one failed subjob of a workflow. The
error recovery mechanism has a maximum of three phases, depending on the
impact of the error. In each phase, we use a dedicated algorithm to remap
the subjobs of the workflow to the resources. The main contributions of the
paper are the error recovery mechanism for SLA-based workflows and
the mapping algorithm G-map, which is used in the first phase of the recovery
mechanism. The G-map remaps the groups of subjobs, which are directly
affected by an error. The efficiency of the proposed algorithm is validated
through simulation results.
en
dc.language.isoenen
dc.publisherInderscienceen
dc.subjectGrid computingen
dc.subjectService Level Agreementen
dc.subjectSLAen
dc.subjectGrid-based workflowen
dc.subjecterror recoveryen
dc.titleA recovery mechanism for errors caused by a late subjob in a system handling SLA-based Grid workflowsen
dc.typeArticleen
dc.identifier.doi10.1504/IJWGS.2008.018493-
Appears in Collections:
Files in This Item:
There are no files associated with this item.

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share