Learning Symbolic Expressions: Mixed-Integer Formulations, Cuts, and Heuristics

Jongeun Kim, Sven Leyffer, Prasanna Balaprakash

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

In this paper, we consider the problem of learning a regression function without assuming its functional form. This problem is referred to as symbolic regression. An expression tree is typically used to represent a solution function, which is determined by assigning operators and operands to the nodes. Cozad and Sahinidis propose a nonconvex mixed-integer nonlinear program (MINLP), in which binary variables are used to assign operators and nonlinear expressions are used to propagate data values through nonlinear operators, such as square, square root, and exponential. We extend this formulation by adding new cuts that improve the solution of this challenging MINLP. We also propose a heuristic that iteratively builds an expression tree by solving a restricted MINLP. We perform computational experiments and compare our approach with a mixed-integer program-based method and a neural network-based method from the literature.

Original languageEnglish
Pages (from-to)1383-1403
Number of pages21
JournalINFORMS Journal on Computing
Volume35
Issue number6
DOIs
StatePublished - Nov 2023

Funding

Funding:This work was supported by the Applied Mathematics activity within the U.S. Department of Energy, Office of Science, Advanced Scientific Computing Research [Grant DE-AC02-06CH11357].

FundersFunder number
U.S. Department of Energy
Office of Science
Advanced Scientific Computing ResearchDE-AC02-06CH11357

    Keywords

    • expression tree
    • local branching heuristic
    • mixed-integer nonlinear programming
    • symbolic regression

    Fingerprint

    Dive into the research topics of 'Learning Symbolic Expressions: Mixed-Integer Formulations, Cuts, and Heuristics'. Together they form a unique fingerprint.

    Cite this