Machine Learning-Enhanced Computational Reverse-Engineering Analysis for Scattering Experiments (CREASE) for Analyzing Fibrillar Structures in Polymer Solutions

  • Zijie Wu
  • , Arthi Jayaraman

Research output: Contribution to journalArticlepeer-review

24 Scopus citations

Abstract

In this work, we present a machine learning (ML)-enhanced computational reverse-engineering analysis of scattering experiments (CREASE) approach to analyze the small-angle scattering profiles from polymer solutions with assembled semiflexible fibrils with dispersity in fibril diameters (e.g., aqueous solutions of methylcellulose fibrils). This work is an improvement over the original CREASE method [ Beltran-Villegas, D. J.; et al. J. Am. Chem. Soc., 2019, 141, 14916-14930 ], which identifies relevant dimensions of assembled structures in polymer solutions from their small-angle scattering profiles without relying on traditional analytical models. Here, we improve the original CREASE approach by incorporating ML for analyzing assembled semiflexible fibrillar structures with disperse fibril diameters. We first validate our CREASE approach without ML by taking as input the scattering profiles of in silico structures with known dimensions (diameter, Kuhn length) and reproducing as output those known dimensions within error. We then show how the incorporation of ML (specifically an artificial neural network, denoted as NN) within the CREASE approach improves the speed of workflow without sacrificing the accuracy of the determined fibrillar dimensions. Finally, we apply NN-enhanced CREASE to experimental small-angle X-ray scattering profiles from methylcellulose fibrils obtained by Lodge, Bates, and co-workers [ Schmidt, P. W.; et al. Macromolecules, 2018, 51, 7767-7775 ] to determine fibril diameter distribution and compare NN-enhanced CREASE's output with their fibril diameter distribution fitted using analytical models. The diameter distributions of methylcellulose fibrils from NN-enhanced CREASE are similar to those obtained from analytical model fits, confirming the results by Lodge, Bates, and co-workers that methylcellulose form fibrils with consistent average diameters of ∼15-20 nm regardless of the molecular weight of methylcellulose chains. The successful implementation of NN-enhanced CREASE in handling experimental scattering profiles of complex macromolecular assembled structures with dispersity in dimensions demonstrates its potential for application toward other unconventional fibrillar systems that may not have appropriate analytical models.

Original languageEnglish
Pages (from-to)11076-11091
Number of pages16
JournalMacromolecules
Volume55
Issue number24
DOIs
StatePublished - Dec 27 2022
Externally publishedYes

Funding

The authors acknowledge financial support from the National Science Foundation, DMR CMMT Grant #2105744. The authors are grateful to Professor Timothy Lodge, Dr. Lucy L. Solomon, and Professor Frank Bates for graciously providing raw experimental SAXS data and useful feedback. The computational work in this paper is supported using the Caviness supercomputing cluster at the University of Delaware and DARWIN supercomputing cluster at the University of Delaware, the latter supported by the National Science Foundation, Grant #1919839.

Fingerprint

Dive into the research topics of 'Machine Learning-Enhanced Computational Reverse-Engineering Analysis for Scattering Experiments (CREASE) for Analyzing Fibrillar Structures in Polymer Solutions'. Together they form a unique fingerprint.

Cite this