Racing Control Variable Genetic Programming for Symbolic Regression

09/13/2023
by   Nan Jiang, et al.
0

Symbolic regression, as one of the most crucial tasks in AI for science, discovers governing equations from experimental data. Popular approaches based on genetic programming, Monte Carlo tree search, or deep reinforcement learning learn symbolic regression from a fixed dataset. They require massive datasets and long training time especially when learning complex equations involving many variables. Recently, Control Variable Genetic Programming (CVGP) has been introduced which accelerates the regression process by discovering equations from designed control variable experiments. However, the set of experiments is fixed a-priori in CVGP and we observe that sub-optimal selection of experiment schedules delay the discovery process significantly. To overcome this limitation, we propose Racing Control Variable Genetic Programming (Racing-CVGP), which carries out multiple experiment schedules simultaneously. A selection scheme similar to that used in selecting good symbolic equations in the genetic programming process is implemented to ensure that promising experiment schedules eventually win over the average ones. The unfavorable schedules are terminated early to save time for the promising ones. We evaluate Racing-CVGP on several synthetic and real-world datasets corresponding to true physics laws. We demonstrate that Racing-CVGP outperforms CVGP and a series of symbolic regressors which discover equations from fixed datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/25/2023

Symbolic Regression via Control Variable Genetic Programming

Learning symbolic expressions directly from experiment data is a vital s...
research
04/27/2020

Fitness Landscape Analysis of Dimensionally-Aware Genetic Programming Featuring Feynman Equations

Genetic programming is an often-used technique for symbolic regression: ...
research
07/29/2021

Contemporary Symbolic Regression Methods and their Relative Performance

Many promising approaches to symbolic regression have been presented in ...
research
09/23/2013

Data Mining using Unguided Symbolic Regression on a Blast Furnace Dataset

In this paper a data mining approach for variable selection and knowledg...
research
03/13/2018

Glyph: Symbolic Regression Tools

We present Glyph - a Python package for genetic programming based symbol...
research
10/02/2022

AI-Assisted Discovery of Quantitative and Formal Models in Social Science

In social science, formal and quantitative models, such as ones describi...
research
06/26/2019

Automatic Discovery of Families of Network Generative Processes

Designing plausible network models typically requires scholars to form a...

Please sign up or login with your details

Forgot password? Click here to reset