Abstract
Linked Science is the practice of inter-connecting scientific assets by publishing, sharing and linking scientific data and processes in end-to-end loosely coupled workflows that allow the sharing and re-use of scientific data. Much of this data does not live in the cloud or on the Web, but rather in multi-institutional data centers that provide tools and add value through quality assurance, validation, curation, dissemination, and analysis of the data. In this paper, we make the case for the use of scientific scenarios in Linked Science. We propose a scenario in river-channel transport that requires biogeochemical experimental data and global climate-simulation model data from many sources. We focus on the use of ontologies-formal machine-readable descriptions of the domain-to facilitate search and discovery of this data. Mercury, developed at Oak Ridge National Laboratory, is a tool for distributed metadata harvesting, search and retrieval. Mercury currently provides uniform access to more than 100,000 metadata records; 30,000 scientists use it each month. We augmented search in Mercury with ontologies, such as the ontologies in the Semantic Web for Earth and Environmental Terminology (SWEET) collection by prototyping a component that provides access to the ontology terms from Mercury. We evaluate the coverage of SWEET for the ORNL Distributed Active Archive Center (ORNL DAAC).
Original language | English |
---|---|
Pages (from-to) | 175-185 |
Number of pages | 11 |
Journal | Earth Science Informatics |
Volume | 6 |
Issue number | 3 |
DOIs | |
State | Published - Sep 2013 |
Funding
Acknowledgments This work has been in part performed at Oak Ridge National Laboratory, Managed by UT Battelle, LLC under Contract No. De-AC05-00OR22725 for the U.S. Department of Energy. Mercury is a tool for distributed metadata harvesting, search, and retrieval originally developed for NASA. Mercury is currently used by projects funded by NASA, USGS, and U.S. Department of Energy (DOE) (Devarakonda et al. 2010). More than 30,000 scientists use Mercury each month. Mercury provides a single portal to search quickly for data and information contained in disparate data-management systems. It collects metadata and key data from contributing project servers distributed around the world and builds a centralized index. Mercury allows data providers to advertise the availability of their data and maintain complete control and ownership of that data. Figure 3 shows a diagram of the Mercury architecture including our added BioPortal module.
Funders | Funder number |
---|---|
National Science Foundation | 0830944 |
U.S. Department of Energy | |
National Aeronautics and Space Administration | |
U.S. Geological Survey | |
Oak Ridge National Laboratory | |
UT-Battelle | De-AC05-00OR22725 |
Keywords
- BioPortal
- Climate change
- Data discovery
- Linked Science
- Ontologies
- Semantic search