Combining three-dimensional modeling with artificial intelligence to increase specificity and precision in peptide-MHC binding predictions

Michelle P. Aranha, Yead S.M. Jewel, Robert A. Beckman, Louis M. Weiner, Julie C. Mitchell, Jerry M. Parks, Jeremy C. Smith

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

The reliable prediction of the affinity of candidate peptides for the MHC is important for predicting their potential antigenicity and thus influences medical applications, such as decisions on their inclusion in T cell-based vaccines. In this study, we present a rapid, predictive computational approach that combines a popular, sequence-based artificial neural network method, NetMHCpan 4.0, with three-dimensional structural modeling. We find that the ensembles of bound peptide conformations generated by the programs MODELLER and Rosetta FlexPepDock are less variable in geometry for strong binders than for low-affinity peptides. In tests on 1271 peptide sequences for which the experimental dissociation constants of binding to the well-characterized murine MHC allele H-2Db are known, by applying thresholds for geometric fluctuations the structure-based approach in a standalone manner drastically improves the statistical specificity, reducing the number of false positives. Furthermore, filtering candidates generated with NetMHCpan 4.0 with the structure-based predictor led to an increase in the positive predictive value (PPV) of the peptides correctly predicted to bind very strongly (i.e., Kd < 100 nM) from 40 to 52% (p = 0.027). The combined method also significantly improved the PPV when tested on five human alleles, including some with limited data for training. Overall, an average increase of 10% in the PPV was found over the standalone sequence-based method. The combined method should be useful in the rapid design of effective T cell-based vaccines.

Original languageEnglish
Pages (from-to)1962-1977
Number of pages16
JournalJournal of Immunology
Volume205
Issue number7
DOIs
StatePublished - Oct 1 2020

Funding

This work was supported by the Laboratory Directed Research and Development Program at Oak Ridge National Laboratory (ORNL), which is managed by UT-Battelle, LLC, for the U.S. Department of Energy under Contract DE-AC05-00OR22725. This work used resources of the Compute and Data Environment for Science at ORNL and of the National Energy Research Scientific Computing Center, a U.S. Department of Energy Office of Science User Facility. L.M.W. was supported by Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health Grants R01-CA50633 and P30-CA51008.

FundersFunder number
National Institutes of HealthP30-CA51008
U.S. Department of EnergyDE-AC05-00OR22725
National Cancer InstituteR01CA050633
Oak Ridge National Laboratory
Division of Cancer Epidemiology and Genetics, National Cancer Institute
National Energy Research Scientific Computing Center

    Fingerprint

    Dive into the research topics of 'Combining three-dimensional modeling with artificial intelligence to increase specificity and precision in peptide-MHC binding predictions'. Together they form a unique fingerprint.

    Cite this