Approaches to Distributed Execution of Scientific Workflows in Kepler
Author/Creator ORCID
Date
Type of Work
Department
Program
Citation of Original Publication
Płóciennik, Marcin, Tomasz Żok, Ilkay Altintas, Jianwu Wang, Daniel Crawl, David Abramson, Frederic Imbeaux, et al. “Approaches to Distributed Execution of Scientific Workflows in Kepler.” Fundamenta Informaticae 128, no. 3 (January 1, 2013): 281–302. https://doi.org/10.3233/FI-2013-947.
Rights
© [Płóciennik, Marcin; Żok, Tomasz; Altintas, Ilkay; Wang, Jianwu; Crawl, Daniel; Abramson, David; Imbeaux, Frederic; Guillerminet, Bernard; Lopez-Caniego, Marcos; Plasencia, Isabel Campos; Pych, Wojciech; Ciecieląg, Pawel; Palak, Bartek; Owsiak, Michał; Frauel, Yann, 2013 ]. The definitive, peer reviewed and edited version of this article is published in Fundamenta Informaticae, vol. 128, no. 3, pp. 281-302, 2013, https://doi.org/10.3233/FI-2013-947.
Subjects
Abstract
The Kepler scientific workflow system enables creation, execution and sharing of workflows across a broad range of scientific and engineering disciplines while also facilitating remote and distributed execution of workflows. In this paper, we present and compare different approaches to distributed execution of workflows using the Kepler environment, including a distributed data-parallel framework using Hadoop and Stratosphere, and Cloud and Grid execution using Serpens, Nimrod/K and Globus actors. We also present real-life applications in computational chemistry, bioinformatics and computational physics to demonstrate the usage of different distributed computing capabilities of Kepler in executable workflows. We further analyze the differences of each approach and provide a guidance for their applications.