Abstract
Collaborative science demands global sharing of scientific data. But it cannot leverage universally accessible cloud-based infrastructures like Drop Box, as those offer limited interfaces and inadequate levels of access bandwidth. We present the Scibox cloud facility for online sharing scientific data. It uses standard cloud storage solutions, but offers a usage model in which high end codes can write/read data to/from the cloud via the APIs they already use for their I/O actions. With Scibox, data upload/download volumes are controlled via Data Reduction-functions stated by end users and applied at the data source, before data is moved, with further gains in efficiency obtained by combining DR-functions to move exactly what is needed by current data consumers. We evaluate Scibox with science applications and their representative data analytics - the GTS fusion and the combustion image processing - demonstrating the potential for ubiquitous data access with substantial reductions in network traffic.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - IEEE 28th International Parallel and Distributed Processing Symposium, IPDPS 2014 |
| Publisher | IEEE Computer Society |
| Pages | 145-154 |
| Number of pages | 10 |
| ISBN (Print) | 9780769552071 |
| DOIs | |
| State | Published - 2014 |
| Event | 28th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2014 - Phoenix, AZ, United States Duration: May 19 2014 → May 23 2014 |
Publication series
| Name | Proceedings of the International Parallel and Distributed Processing Symposium, IPDPS |
|---|---|
| ISSN (Print) | 1530-2075 |
| ISSN (Electronic) | 2332-1237 |
Conference
| Conference | 28th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2014 |
|---|---|
| Country/Territory | United States |
| City | Phoenix, AZ |
| Period | 05/19/14 → 05/23/14 |
Keywords
- Cloud Storage
- Data Sharing
- Scientific Data