Approaches to Distributed Execution of Scientific Workflows in Kepler

Date

Department

Program

Citation of Original Publication

Płóciennik, Marcin, Tomasz Żok, Ilkay Altintas, Jianwu Wang, Daniel Crawl, David Abramson, Frederic Imbeaux, et al. “Approaches to Distributed Execution of Scientific Workflows in Kepler.” Fundamenta Informaticae 128, no. 3 (January 1, 2013): 281–302. https://doi.org/10.3233/FI-2013-947.

Rights

© [Płóciennik, Marcin; Żok, Tomasz; Altintas, Ilkay; Wang, Jianwu; Crawl, Daniel; Abramson, David; Imbeaux, Frederic; Guillerminet, Bernard; Lopez-Caniego, Marcos; Plasencia, Isabel Campos; Pych, Wojciech; Ciecieląg, Pawel; Palak, Bartek; Owsiak, Michał; Frauel, Yann, 2013 ]. The definitive, peer reviewed and edited version of this article is published in Fundamenta Informaticae, vol. 128, no. 3, pp. 281-302, 2013, https://doi.org/10.3233/FI-2013-947.

Abstract

The Kepler scientific workflow system enables creation, execution and sharing of workflows across a broad range of scientific and engineering disciplines while also facilitating remote and distributed execution of workflows. In this paper, we present and compare different approaches to distributed execution of workflows using the Kepler environment, including a distributed data-parallel framework using Hadoop and Stratosphere, and Cloud and Grid execution using Serpens, Nimrod/K and Globus actors. We also present real-life applications in computational chemistry, bioinformatics and computational physics to demonstrate the usage of different distributed computing capabilities of Kepler in executable workflows. We further analyze the differences of each approach and provide a guidance for their applications.