Abstract
Understanding and exploiting topographical data via standard machine learning techniques is challenging, mainly due to the large dynamic range of values present in elevation data and the lack of direct relationships between anthropogenic phenomena and topography, when considering topographic-geology couplings, for instance. Here we consider the first hurdle, dynamic range, in an effort to apply Convolutional Neural Network (CNN) approaches for prediction of human activity. CNN for learning 3-D elevation data relies on data normalization approaches, which only consider locally available points, thereby discarding contextual information and eliminating global contrast cues. We present a fully invertible and data-driven global partitioning elevation normalization (GPEN) preprocessing technique, which is intended to ameliorate the impact of limited data dynamic range. Global elevation populations are derived and used to formulate a distribution, which is used to adopt a partitioning scheme to remap all values according to global occurrence frequency, while preserving partition contrast. Using USGS 3-D Elevation Project and Microsoft building footprint data, we conduct a binary classification experiment predicting building footprint presence from elevation data, with and without a global remapping using the SegNet convolutional encoder-decoder model. The results of the experiment show more rapid model convergence, reduced regionalization errors, and enhanced classification metrics when compared to standard normalization preprocessing techniques. GPEN demonstrates performance over 10% higher than the next best conventional preprocessing method, with a mean overall accuracy of 94.76%. GPEN may show promise as an alternative normalization for deep learning with topological data.
Original language | English |
---|---|
Article number | 9117159 |
Pages (from-to) | 3493-3502 |
Number of pages | 10 |
Journal | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing |
Volume | 13 |
DOIs | |
State | Published - 2020 |
Funding
Manuscript received February 25, 2020; revised May 18, 2020; accepted June 10, 2020. Date of publication June 15, 2020; date of current version July 2, 2020. This work was supported in part by the Science Education and Workforce Development Programs at Oak Ridge National Laboratory, administered by ORISE, through the U.S. Department of Energy Oak Ridge Institute for Science and Education. (Corresponding author: Alexander Fafard.) Alexander Fafard and Jan van Aardt are with the Department of Imaging Science, Rochester Institute of Technology, Rochester, NY 14623 USA (e-mail: [email protected]; [email protected]).
Funders | Funder number |
---|---|
Oak Ridge National Laboratory |
Keywords
- Machine vision
- data processing
- optical data processing
- remote sensing