DeepDriveMD: Deep-learning driven adaptive molecular simulations for protein folding

Hyungro Lee, Matteo Turilli, Shantenu Jha, Debsindhu Bhowmik, Heng Ma, Arvind Ramanathan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

55 Scopus citations

Abstract

Simulations of biological macromolecules are important in understanding the physical basis of complex processes such as protein folding. However, even with increasing computational capacity and specialized architectures, the ability to simulate protein folding at atomistic scales still remains challenging. This stems from the dual aspects of high dimensionality of protein conformational landscapes, and the inability of atomistic molecular dynamics (MD) simulations to sufficiently sample these landscapes to observe folding events. Machine learning/deep learning (ML/DL) techniques, when combined with atomistic MD simulations offer the opportunity to potentially overcome these limitations by: (1) effectively reducing the dimensionality of MD simulations to automatically build latent representations that correspond to biophysically relevant reaction coordinates (RCs), and (2) driving MD simulations to automatically sample potentially novel conformational states based on these RCs. We examine how coupling DL approaches with MD simulations can lead to effective approaches to fold small proteins on supercomputers. In particular, we study the computational costs and effectiveness of scaling DL-coupled MD workflows implemented using RADICAL-Cybertools in folding two prototypical systems, namely Fs-peptide and the fast-folding variant of the villin head piece protein. We demonstrate that a DL-coupled MD workflow is able to effectively learn latent representations and drive adaptive simulations. Compared to traditional MD-based approaches, our approach achieves an effective performance gain in sampling the folded states by at least 2.3x. Together, our study provides quantitative basis to understand how coupling DL approaches to MD simulations, can lead to effective performance gains and reduced times to solution on supercomputing resources.

Original languageEnglish
Title of host publicationProceedings of DLS 2019
Subtitle of host publicationDeep Learning on Supercomputers - Held in conjunction with SC 2019: The International Conference for High Performance Computing, Networking, Storage and Analysis
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages12-19
Number of pages8
ISBN (Electronic)9781728160115
DOIs
StatePublished - Nov 2019
Event3rd IEEE/ACM Workshop on Deep Learning on Supercomputers, DLS 2019 - Denver, United States
Duration: Nov 17 2019 → …

Publication series

NameProceedings of DLS 2019: Deep Learning on Supercomputers - Held in conjunction with SC 2019: The International Conference for High Performance Computing, Networking, Storage and Analysis

Conference

Conference3rd IEEE/ACM Workshop on Deep Learning on Supercomputers, DLS 2019
Country/TerritoryUnited States
CityDenver
Period11/17/19 → …

Funding

This work was supported by NSF DIBBS 1443054, RADICAL-Cybertools NSF 1440677 and 1931512 (Rutgers) and Argonne Laboratory Directed Research and Development Computing Expedition project. This research used resources of the Oak Ridge Leadership Computing Facility at ORNL, supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725.

FundersFunder number
National Science Foundation1440677, 1931512, 1443054
U.S. Department of EnergyDE-AC05-00OR22725
Office of Science
Oak Ridge National Laboratory
Laboratory Directed Research and Development

    Keywords

    • Deep learning
    • Machine learning
    • Molecular dynamics
    • Protein folding

    Fingerprint

    Dive into the research topics of 'DeepDriveMD: Deep-learning driven adaptive molecular simulations for protein folding'. Together they form a unique fingerprint.

    Cite this