Abstract
Rapid proliferation of hyperspectral imaging in scanning probe microscopy creates unique opportunities to systematically capture and categorize higher dimensional datasets, toward insights into electronic, mechanical, and chemical properties of materials with nano- and atomic-scale resolution. Effective hyperspectral imaging requires a consistent framework for data analysis that would be broadly applicable, reproducible, and transferrable, conceptually resembling the success of integral transforms in image analysis. Here, we demonstrate application of similarity learning for resolving the structure of tunneling spectroscopy data, characterizing a superconducting material with sparse density of defects. Popular methods for unsupervised learning and discrete representation of the data in terms of clusters of characteristic behaviors were found to produce inconsistencies with respect to capturing the location and tunneling characteristics of defect sites. The underlying reason for their ambiguity was traced to continuous variation of the electronic properties across the surface and therefore the absence of clear structural boundaries in the low-dimensional latent spaces of the data. We supported this hypothesis by direct analysis of the distributions of Euclidean distances within the dataset. We further proposed distance rescaling with probabilistic description as a possible approach to mitigate the detrimental effect of the long tails of the distributions on the performance of clustering methods. Subsequently, we applied a more general, nonlinear similarity learning, where dimension reduction was explicitly trained to amplify similarities and dissimilarities among individual spectra. This approach was found to outperform several widely used methods for dimensionality reduction and produce a clear categorization of tunneling spectra. Significant spectral weight transfer associated with the electronic reconstruction by the vacancy sites was systematically captured, as was the spatial extent of the vacancy region. Given that a great variety of electronic materials will exhibit similarly smooth variation of spectral response due to random or engineered inhomogeneities, we believe our approach will be useful for systematic analysis of hyperspectral imaging with minimal prior knowledge as well as prospective comparison of experimental measurements with theoretical calculations with explicit consideration of disorder.
Original language | English |
---|---|
Article number | 033058 |
Journal | Physical Review Research |
Volume | 4 |
Issue number | 3 |
DOIs | |
State | Published - Jul 2022 |
Funding
We thank Rama Vasudevan for discussion and improvement of the manuscript. Research sponsored by Division of Materials Science and Engineering, Basic Energy Sciences, Office of Science, U.S. Department of Energy (DOE). Experiments were carried out as part of a user project at the Center for Nanophase Materials Sciences, ORNL, a U.S. DOE of Science User Facility. This paper used resources of the Compute and Data Environment for Science (CADES) at the ORNL, which is supported by the Office of Science of the U.S. DOE under Contract No. DE-AC05-00OR22725. ORNL is managed by UT-Battelle, LLC, for the U.S. DOE. This paper is a contribution of the U.S. Government, not subject to U.S. copyright. The DOE will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan .
Funders | Funder number |
---|---|
CADES | |
Data Environment for Science | |
U.S. Department of Energy | |
Office of Science | |
Basic Energy Sciences | |
Oak Ridge National Laboratory | DE-AC05-00OR22725 |
Division of Materials Sciences and Engineering |