Kepler + CometCloud: Dynamic Scientific Workflow Execution on Federated Cloud Resources

Date

2016-06-01

Department

Program

Citation of Original Publication

Wang, Jianwu, Moustafa AbdelBaky, Javier Diaz-Montes, Shweta Purawat, Manish Parashar, and Ilkay Altintas. “Kepler + CometCloud: Dynamic Scientific Workflow Execution on Federated Cloud Resources.” Procedia Computer Science, International Conference on Computational Science 2016, ICCS 2016, 6-8 June 2016, San Diego, California, USA, 80 (January 1, 2016): 700–711. https://doi.org/10.1016/j.procs.2016.05.363.

Rights

This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
Attribution-NonCommercial-NoDerivs 4.0 International (CC BY-NC-ND 4.0 DEED)

Abstract

The widespread availability and variety of cloud offerings and their associated access models has drastically grown over the past few years. It is now common for users to have access to multiple infrastructures (e.g., campus clusters, cloud resources), however, deploying complex application workflows on top of these resources remains a challenge. In this paper we propose an approach that allows users to build and run scientific workflows on top of a federation of multiple clouds and traditional resources (e.g., clusters). We achieve this by integrating the Kepler scientific workflow platform with the CometCloud framework. This allows us to: 1) dynamically and programmatically provision and aggregate resources, 2) easily compose complex workflows, and 3) dynamically schedule and execute these workflows based on provenance and overall objectives on the resulting federation of resources. We demonstrate our approach and evaluate its capabilities by running a bioinformatics workflow on top of a federation composed of a campus cluster and two clouds.