ECHOVIT: Vision Transformers Using Fast-And-Slow Time Embeddings

Oluwanisola Ibikunle, Debvrat Varshney, Jilu Li, Maryam Rahnemoonfar, John Paden

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

This paper details the preliminary efforts of applying the deep learning transformer architecture to automatically track annual layer stratigraphy in echogram images obtained from mapping near-surface ice layers using airborne radars. Following the success of the transformer architecture in the natural language processing and computer vision communities, we explore a variant termed Echogram Vision Transformer (EchoViT) on the radar echogram layer tracking (RELT) problem. The proposed approach divides the echogram images into patches using different schemes inspired by tokenization methods in natural language processing. We then apply a soft-attention mechanism to model interdependencies between the patches, capturing spatiotemporal stratigraphic information. Experiments conducted on the CREED dataset demonstrate the superiority of transformer-based architectures over existing convolutional-based architectures. Furthermore, the EchoViT fast-time and EchoViT slow-time patchifying schemes achieved precise tracking of the layers with submeter MAE of 3.39 and 3.55, respectively, while the use of cropped patches led to suboptimal results.

Original languageEnglish
Title of host publicationIGARSS 2023 - 2023 IEEE International Geoscience and Remote Sensing Symposium, Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages8162-8165
Number of pages4
ISBN (Electronic)9798350320107
DOIs
StatePublished - 2023
Externally publishedYes
Event2023 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2023 - Pasadena, United States
Duration: Jul 16 2023Jul 21 2023

Publication series

NameInternational Geoscience and Remote Sensing Symposium (IGARSS)
Volume2023-July

Conference

Conference2023 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2023
Country/TerritoryUnited States
CityPasadena
Period07/16/2307/21/23

Funding

Center for Remote Sensing and Integrated Systems (CReSIS) is the new name of the Center for Remote Sensing and Ice Sheets (CReSIS). This work is supported by NSF IIS-1838236 and ACI-1443054.

Keywords

  • deep learning
  • EchoViT
  • soft-attention
  • transformer
  • ViT

Fingerprint

Dive into the research topics of 'ECHOVIT: Vision Transformers Using Fast-And-Slow Time Embeddings'. Together they form a unique fingerprint.

Cite this