Anvil - System Architecture and Experiences from Deployment and Early User Operations

  • X. Carol Song
  • , Preston Smith
  • , Rajesh Kalyanam
  • , Xiao Zhu
  • , Eric Adams
  • , Kevin Colby
  • , Patrick Finnegan
  • , Erik Gough
  • , Elizabett Hillery
  • , Rick Irvine
  • , Amiya Maji
  • , Jason St. John

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

36 Scopus citations

Abstract

Anvil is a new XSEDE advanced capacity computational resource funded by NSF. Designed with a systematic strategy to meet the ever increasing and diversifying research needs for advanced computational capacity, Anvil integrates a large capacity high-performance computing (HPC) system with a comprehensive ecosystem of software, access interfaces, programming environments, and composable services in a seamless environment to support a broad range of current and future science and engineering applications of the nation's research community. Anchored by a 1000-node CPU cluster featuring the latest AMD EPYC 3rd generation (Milan) processors, along with a set of 1TB large memory and NVIDIA A100 GPU nodes, Anvil integrates a multi-tier storage system, a Kubernetes composable subsystem, and a pathway to Azure commercial cloud to support a variety of workflows and storage needs. Anvil was successfully deployed and integrated with XSEDE during the world-wide COVID-19 pandemic. Entering production operation in February 2022, Anvil will serve the nation's science and engineering research community for five years. This paper describes the Anvil system and services, including its various components and subsystems, user facing features, and shares the Anvil team's experience through its early user access program from November 2021 through January 2022.

Original languageEnglish
Title of host publicationPEARC 2022 Conference Series - Practice and Experience in Advanced Research Computing 2022 - Revolutionary
Subtitle of host publicationComputing, Connections, You
PublisherAssociation for Computing Machinery, Inc
ISBN (Electronic)9781450391610
DOIs
StatePublished - Jul 8 2022
Externally publishedYes
Event2022 Conference on Practice and Experience in Advanced Research Computing: Revolutionary: Computing, Connections, You, PEARC 2022 - Boston, United States
Duration: Jul 10 2022Jul 14 2022

Publication series

NamePEARC 2022 Conference Series - Practice and Experience in Advanced Research Computing 2022 - Revolutionary: Computing, Connections, You

Conference

Conference2022 Conference on Practice and Experience in Advanced Research Computing: Revolutionary: Computing, Connections, You, PEARC 2022
Country/TerritoryUnited States
CityBoston
Period07/10/2207/14/22

Funding

This material is based upon work supported by the National Science Foundation under Grant No. (OAC-2005632) . The authors wish to thank our early users for sharing their science accomplishments with the Anvil team and approving their inclusion in this paper; and Alexander Younts for his work during the design and acquisition of Anvil.

Fingerprint

Dive into the research topics of 'Anvil - System Architecture and Experiences from Deployment and Early User Operations'. Together they form a unique fingerprint.

Cite this