Symbolic Regression using Mixed-Integer Nonlinear Optimization

dc.contributor.authorAustel, Vernon
dc.contributor.authorCornelio, Cristina
dc.contributor.authorDash, Sanjeeb
dc.contributor.authorGoncalves, Joao
dc.contributor.authorHoresh, Lior
dc.contributor.authorJosephson, Tyler R.
dc.contributor.authorMegiddo, Nimrod
dc.date.accessioned2021-03-09T19:01:13Z
dc.date.available2021-03-09T19:01:13Z
dc.date.issued2020-06-11
dc.description.abstractThe Symbolic Regression (SR) problem, where the goal is to find a regression function that does not have a pre-specified form but is any function that can be composed of a list of operators, is a hard problem in machine learning, both theoretically and computationally. Genetic programming based methods, that heuristically search over a very large space of functions, are the most commonly used methods to tackle SR problems. An alternative mathematical programming approach, proposed in the last decade, is to express the optimal symbolic expression as the solution of a system of nonlinear equations over continuous and discrete variables that minimizes a certain objective, and to solve this system via a global solver for mixed-integer nonlinear programming problems. Algorithms based on the latter approach are often very slow. We propose a hybrid algorithm that combines mixed-integer nonlinear optimization with explicit enumeration and incorporates constraints from dimensional analysis. We show that our algorithm is competitive, for some synthetic data sets, with a state-of-the-art SR software and a recent physics-inspired method called AI Feynman.en_US
dc.description.sponsorshipTyler Josephson was primarily supported by the U.S. Department of Energy (DOE), Office of Basic Energy Sciences, Division of Chemical Sciences, Geosciences and Biosciences under Award DE-FG02- 17ER16362. Tyler Josephson and Lior Horesh gratefully acknowledge the support of the Institute for Mathematics and its Applications (IMA), where a part of this work was initiated.en_US
dc.description.urihttps://arxiv.org/abs/2006.06813en_US
dc.format.extent9 pagesen_US
dc.genrejournal articles preprintsen_US
dc.identifierdoi:10.13016/m2oifx-idbq
dc.identifier.citationAustel, Vernon; Cornelio, Cristina; Dash, Sanjeeb; Goncalves, Joao; Horesh, Lior; Josephson, Tyler R.; Megiddo, Nimrod; Symbolic Regression using Mixed-Integer Nonlinear Optimization; Machine Learning (2020); https://arxiv.org/abs/2006.06813en_US
dc.identifier.urihttp://hdl.handle.net/11603/21127
dc.language.isoen_USen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Chemical, Biochemical & Environmental Engineering Department Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.titleSymbolic Regression using Mixed-Integer Nonlinear Optimizationen_US
dc.typeTexten_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2006.06813.pdf
Size:
358.57 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: