Metrics for Benchmarking and Uncertainty Quantification: Quality, Applicability, and Best Practices for Machine Learning in Chemistry

Gaurav Vishwakarma, Aditya Sonpal, Johannes Hachmann

Research output: Contribution to journalReview articlepeer-review

44 Scopus citations

Abstract

This review aims to draw attention to two issues of concern when we set out to make machine learning work in the chemical and materials domain, that is, statistical loss function metrics for the validation and benchmarking of data-derived models, and the uncertainty quantification of predictions made by them. They are often overlooked or underappreciated topics as chemists typically only have limited training in statistics. Aside from helping to assess the quality, reliability, and applicability of a given model, these metrics are also key to comparing the performance of different models and thus for developing guidelines and best practices for the successful application of machine learning in chemistry.

Original languageEnglish
Pages (from-to)146-156
Number of pages11
JournalTrends in Chemistry
Volume3
Issue number2
DOIs
StatePublished - Feb 2021
Externally publishedYes

Funding

This work was supported by the NSF CAREER program under grant No. OAC-1751161 and the NSF Big Data Spokes program under grant No. IIS-1761990 .

Keywords

  • benchmarking
  • machine learning
  • model validation
  • statistical loss function
  • uncertainty quantification

Fingerprint

Dive into the research topics of 'Metrics for Benchmarking and Uncertainty Quantification: Quality, Applicability, and Best Practices for Machine Learning in Chemistry'. Together they form a unique fingerprint.

Cite this