A web service-enabled distributed workflow system for scientific data processing

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

This paper presents the design and implementation of a distributed data-driven workflow system on top of the TeraGrid infrastructure. The workflow system is based on a data management architecture that provides easy access to scientific data collections via the TeraGrid network. The workflow system allows researchers to construct scientific workflows for data discovery, access, transformation, and analysis. The system leverages JOpera, an open-source workflow engine and visual composer, as well as a set of web service-based data and computation modules. To demonstrate its effectiveness, we create an end-to-end climate simulation data analysis workflow that connects the data management architecture to TeraGrid computation resources. We also develop a workflow monitoring service to keep track of distributed workflow execution.

Original languageEnglish
Title of host publicationProceedings - FTDCS 2007
Subtitle of host publication11th IEEE International Workshop on Future Trends of Distributed Computing Systems
Pages7-14
Number of pages8
DOIs
StatePublished - 2007
Externally publishedYes
EventFTDCS 2007: 11th IEEE International Workshop on Future Trends of Distributed Computing Systems - Sedona, AZ, United States
Duration: Mar 21 2007Mar 23 2007

Publication series

NameProceedings of the IEEE Computer Society Workshop on Future Trends of Distributed Computing Systems

Conference

ConferenceFTDCS 2007: 11th IEEE International Workshop on Future Trends of Distributed Computing Systems
Country/TerritoryUnited States
CitySedona, AZ
Period03/21/0703/23/07

Fingerprint

Dive into the research topics of 'A web service-enabled distributed workflow system for scientific data processing'. Together they form a unique fingerprint.

Cite this