Fault-tolerant execution of large parameter sweep applications across multiple VOs with storage constraints

Ayyub, Shahaan, Abramson, David, Enticott, Colin, Garic, Slavisa and Tan, Jefferson (2009). Fault-tolerant execution of large parameter sweep applications across multiple VOs with storage constraints. In: Special Issue: The Best of CCGrid'2007: A Snapshot of an ‘Adolescent’ Area. 7th IEEE International Symposium on Cluster Computing and the Grid, Rio de Janeiro Brazil, (377-392). 14-17 May 2007. doi:10.1002/cpe.1353

Attached Files (Some files may be inaccessible until you login with your UQ eSpace credentials)
Name Description MIMEType Size Downloads

Author Ayyub, Shahaan
Abramson, David
Enticott, Colin
Garic, Slavisa
Tan, Jefferson
Title of paper Fault-tolerant execution of large parameter sweep applications across multiple VOs with storage constraints
Conference name 7th IEEE International Symposium on Cluster Computing and the Grid
Conference location Rio de Janeiro Brazil
Conference dates 14-17 May 2007
Proceedings title Special Issue: The Best of CCGrid'2007: A Snapshot of an ‘Adolescent’ Area   Check publisher's open access policy
Journal name Concurrency Computation Practice and Experience   Check publisher's open access policy
Place of Publication Chichester, West Sussex, United Kingdom
Publisher John Wiley & Sons Ltd.
Publication Year 2009
Year available 2009
Sub-type Fully published paper
DOI 10.1002/cpe.1353
Open Access Status
ISSN 1532-0626
Volume 21
Issue 3
Start page 377
End page 392
Total pages 16
Collection year 2010
Language eng
Abstract/Summary Applications that span multiple virtual organizations (VOs) are of great interest to the e-science community. However, our recent attempts to execute large-scale parameter sweep applications (PSAs) for real-world climate studies with the Nimrod/G tool have exposed problems in the areas of fault tolerance, data storage and trust management. In response, we have implemented a task-splitting approach that facilitates breaking up large PSAs into a sequence of dependent subtasks, improving fault tolerance; provides a garbage collection technique that deletes unnecessary data; and employs a trust delegation technique that facilitates flexible third party data transfers across different VOs. Copyright
Subjects 1705 Computer Networks and Communications
1706 Computer Science Applications
1712 Software
1703 Computational Theory and Mathematics
2614 Theoretical Computer Science
Keyword E-science
Parameter sweep applications&Grid
Q-Index Code E1
Q-Index Status Provisional Code
Institutional Status Non-UQ

 
Versions
Version Filter Type
Citation counts: TR Web of Science Citation Count  Cited 1 times in Thomson Reuters Web of Science Article | Citations
Scopus Citation Count Cited 2 times in Scopus Article | Citations
Google Scholar Search Google Scholar
Created: Thu, 19 Dec 2013, 12:09:22 EST by Ms Diana Cassidy on behalf of Research Computing Centre