TY - JOUR
T1 - Dynamics-Based Peptide-MHC Binding Optimization by a Convolutional Variational Autoencoder
T2 - A Use-Case Model for CASTELO
AU - Bell, David R.
AU - Domeniconi, Giacomo
AU - Yang, Chih Chieh
AU - Zhou, Ruhong
AU - Zhang, Leili
AU - Cong, Guojing
N1 - Publisher Copyright:
© 2021 The Authors. Published by American Chemical Society.
PY - 2021/12/14
Y1 - 2021/12/14
N2 - An unsolved challenge in the development of antigen-specific immunotherapies is determining the optimal antigens to target. Comprehension of antigen-major histocompatibility complex (MHC) binding is paramount toward achieving this goal. Here, we apply CASTELO, a combined machine learning-molecular dynamics (ML-MD) approach, to identify per-residue antigen binding contributions and then design novel antigens of increased MHC-II binding affinity for a type 1 diabetes-implicated system. We build upon a small-molecule lead optimization algorithm by training a convolutional variational autoencoder (CVAE) on MD trajectories of 48 different systems across four antigens and four HLA serotypes. We develop several new machine learning metrics including a structure-based anchor residue classification model as well as cluster comparison scores. ML-MD predictions agree well with experimental binding results and free energy perturbation-predicted binding affinities. Moreover, ML-MD metrics are independent of traditional MD stability metrics such as contact area and root-mean-square fluctuations (RMSF), which do not reflect binding affinity data. Our work supports the role of structure-based deep learning techniques in antigen-specific immunotherapy design.
AB - An unsolved challenge in the development of antigen-specific immunotherapies is determining the optimal antigens to target. Comprehension of antigen-major histocompatibility complex (MHC) binding is paramount toward achieving this goal. Here, we apply CASTELO, a combined machine learning-molecular dynamics (ML-MD) approach, to identify per-residue antigen binding contributions and then design novel antigens of increased MHC-II binding affinity for a type 1 diabetes-implicated system. We build upon a small-molecule lead optimization algorithm by training a convolutional variational autoencoder (CVAE) on MD trajectories of 48 different systems across four antigens and four HLA serotypes. We develop several new machine learning metrics including a structure-based anchor residue classification model as well as cluster comparison scores. ML-MD predictions agree well with experimental binding results and free energy perturbation-predicted binding affinities. Moreover, ML-MD metrics are independent of traditional MD stability metrics such as contact area and root-mean-square fluctuations (RMSF), which do not reflect binding affinity data. Our work supports the role of structure-based deep learning techniques in antigen-specific immunotherapy design.
UR - http://www.scopus.com/inward/record.url?scp=85119913248&partnerID=8YFLogxK
U2 - 10.1021/acs.jctc.1c00870
DO - 10.1021/acs.jctc.1c00870
M3 - Article
C2 - 34793168
AN - SCOPUS:85119913248
SN - 1549-9618
VL - 17
SP - 7962
EP - 7971
JO - Journal of Chemical Theory and Computation
JF - Journal of Chemical Theory and Computation
IS - 12
ER -