Scalable HPC & AI infrastructure for COVID-19 therapeutics

Hyungro Lee, Andre Merzky, Li Tan, Mikhail Titov, Matteo Turilli, Dario Alfe, Agastya Bhati, Alex Brace, Austin Clyde, Peter Coveney, Heng Ma, Arvind Ramanathan, Rick Stevens, Anda Trifan, Hubertus Van Dam, Shunzhou Wan, Sean Wilkinson, Shantenu Jha

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

13 Scopus citations

Abstract

COVID-19 has claimed more than 2.7 × 106 lives and resulted in over 124 × 106 infections. There is an urgent need to identify drugs that can inhibit SARS-CoV-2. We discuss innovations in computational infrastructure and methods that are accelerating and advancing drug design. Specifically, we describe several methods that integrate artificial intelligence and simulation-based approaches, and the design of computational infrastructure to support these methods at scale. We discuss their implementation, characterize their performance, and highlight science advances that these capabilities have enabled.

Original languageEnglish
Title of host publicationProceedings of the Platform for Advanced Scientific Computing Conference, PASC 2021
PublisherAssociation for Computing Machinery, Inc
ISBN (Electronic)9781450385633
DOIs
StatePublished - Jul 5 2021
Event2021 Platform for Advanced Scientific Computing Conference, PASC 2021 - Virtual, Online, Switzerland
Duration: Jul 5 2021Jul 9 2021

Publication series

NameProceedings of the Platform for Advanced Scientific Computing Conference, PASC 2021

Conference

Conference2021 Platform for Advanced Scientific Computing Conference, PASC 2021
Country/TerritorySwitzerland
CityVirtual, Online
Period07/5/2107/9/21

Funding

Research was supported by the DOE Office of Science through the National Virtual Biotechnology Laboratory; as part of the CANDLE project by the ECP (17-SC-20-SC); UK MRC Medical Bioinformatics project (grant no. MR/L016311/1), UKCOMES (grant no. EP/L00030X/1); EU H2020 CompBioMed2 Centre of Excellence (grant no. 823712), and support from the UCL Provost. Access to SuperMUC-NG (LRZ) was made possible by a special COVID-19 allocation award from the Gauss Centre for Supercomputing in Germany. Anda Trifan acknowledges support from the United States Department of Energy through the Computational Sciences Graduate Fellowship (DOE CSGF) under grant number: DE-SC0019323. We acknowledge support and allocation from TACC and OLCF. The effectiveness and impact of the infrastructure are evidenced by its use to sustain a campaign on multiple heterogeneous platforms over months to generate valuable scientific insight (§5). This work is a harbinger of the evolving role of supercomputers, viz., increasingly important generators of data for powerful ML models (e.g., WF1). In general, supercomputers will have to support campaigns with diverse components, viz., physics-based simulations, data generation and analysis, and ML/AI tasks. These individual workflows have different computational characteristics and performance challenges. They encompass high-throughput function calls, ensembles of MPI-based simulations, and AI-driven HPC simulations. There are no “turnkey solutions” to support such campaigns across multiple heterogeneous platforms, with the necessary performance and scale to ensure the required throughput. This has necessitated the design, development, and iterative improvement of infrastructure to advance therapeutics for COVID-19 and beyond. Acknowledgements: Research was supported by the DOE Office of Science through the National Virtual Biotechnology Laboratory; as part of the CANDLE project by the ECP (17-SC-20-SC); UK MRC Medical Bioinformatics project (grant no. MR/L016311/1), UKCOMES (grant no. EP/L00030X/1); EU H2020 CompBioMed2 Centre of Excellence (grant no. 823712), and support from the UCL Provost. Access to SuperMUC-NG (LRZ) was made possible by a special COVID-19 allocation award from the Gauss Centre for Supercomputing in Germany. Anda Trifan acknowledges support from the United States Department of Energy through the Computational Sciences Graduate Fellowship (DOE CSGF) under grant number: DE-SC0019323. We acknowledge support and allocation from TACC and OLCF.

FundersFunder number
Gauss Centre for Supercomputing
H2020 CompBioMed2 Centre of Excellence823712
National Virtual Biotechnology Laboratory17-SC-20-SC
OLCF
TACC
UCL Provost
UKCOMESEP/L00030X/1
U.S. Department of EnergyDE-SC0019323
Office of Science
Medical Research CouncilMR/L016311/1

    Keywords

    • COVID-19
    • Docking molecular dynamics
    • Free energy estimation
    • High-performance computing
    • Machine learning
    • Workflows

    Fingerprint

    Dive into the research topics of 'Scalable HPC & AI infrastructure for COVID-19 therapeutics'. Together they form a unique fingerprint.

    Cite this