Physics and chemistry from parsimonious representations: image analysis via invariant variational autoencoders

Mani Valleti, Maxim Ziatdinov, Yongtao Liu, Sergei V. Kalinin

Research output: Contribution to journalArticlepeer-review

Abstract

Electron, optical, and scanning probe microscopy methods are generating ever increasing volume of image data containing information on atomic and mesoscale structures and functionalities. This necessitates the development of the machine learning methods for discovery of physical and chemical phenomena from the data, such as manifestations of symmetry breaking phenomena in electron and scanning tunneling microscopy images, or variability of the nanoparticles. Variational autoencoders (VAEs) are emerging as a powerful paradigm for the unsupervised data analysis, allowing to disentangle the factors of variability and discover optimal parsimonious representation. Here, we summarize recent developments in VAEs, covering the basic principles and intuition behind the VAEs. The invariant VAEs are introduced as an approach to accommodate scale and translation invariances present in imaging data and separate known factors of variations from the ones to be discovered. We further describe the opportunities enabled by the control over VAE architecture, including conditional, semi-supervised, and joint VAEs. Several case studies of VAE applications for toy models and experimental datasets in Scanning Transmission Electron Microscopy are discussed, emphasizing the deep connection between VAE and basic physical principles. Python codes and datasets discussed in this article are available at https://github.com/saimani5/VAE-tutorials and can be used by researchers as an application guide when applying these to their own datasets.

Original languageEnglish
Article number183
Journalnpj Computational Materials
Volume10
Issue number1
DOIs
StatePublished - Dec 2024

Fingerprint

Dive into the research topics of 'Physics and chemistry from parsimonious representations: image analysis via invariant variational autoencoders'. Together they form a unique fingerprint.

Cite this