Spatio-temporal data model for integrating evolving nation-level datasets

Alexandre Sorokine, Robert N. Stewart

Research output: Contribution to journalConference articlepeer-review

Abstract

Ability to easily combine the data from diverse sources in a single analytical workflow is one of the greatest promises of the Big Data technologies. However, such integration is often challenging as datasets originate from different vendors, governments, and research communities that results in multiple incompatibilities including data representations, formats, and semantics. Semantics differences are hardest to handle: different communities often use different attribute definitions and associate the records with different sets of evolving geographic entities. Analysis of global socioeconomic variables across multiple datasets over prolonged time is often complicated by the difference in how boundaries and histories of countries or other geographic entities are represented. Here we propose an event-based data model for depicting and tracking histories of evolving geographic units (countries, provinces, etc.) and their representations in disparate data. The model addresses the semantic challenge of preserving identity of geographic entities over time by defining criteria for the entity existence, a set of events that may affect its existence, and rules for mapping between different representations (datasets). Proposed model is used for maintaining an evolving compound database of global socioeconomic and environmental data harvested from multiple sources. Practical implementation of our model is demonstrated using PostgreSQL object-relational database with the use of temporal, geospatial, and NoSQL database extensions.

Original languageEnglish
Pages (from-to)69-76
Number of pages8
JournalISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Volume4
Issue number4W2
DOIs
StatePublished - Oct 19 2013
Event2nd International Symposium on Spatiotemporal Computing, ISSC 2017 - Cambridge, United States
Duration: Aug 7 2017Aug 9 2017

Funding

This manuscript has been authored by UT-Battelle, LLC under Contract No. DE–AC05–00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/ downloads/doe-public-access-plan). This material is based upon work supported by the U.S. Department of Energy. This manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy.

FundersFunder number
LLCDE-AC05-00OR22725
UT-Battelle
U.S. Department of Energy

    Keywords

    • Big data
    • Data model
    • Spatiotemporal databases

    Fingerprint

    Dive into the research topics of 'Spatio-temporal data model for integrating evolving nation-level datasets'. Together they form a unique fingerprint.

    Cite this