Characterising the Double Descent of Symbolic Regression

Grant Dick; Caitlin Owen

doi:10.1145/3638530.3664176

Back

Conference proceeding

Characterising the Double Descent of Symbolic Regression

Grant Dick and Caitlin Owen

Proceedings of the Genetic and Evolutionary Computation Conference Companion, pp.2050-2057

GECCO '24 Companion: Genetic and Evolutionary Computation Conference Companion

ACM Conferences

14/07/2024

DOI: https://doi.org/10.1145/3638530.3664176

Abstract

Computing methodologies -- Machine learning -- Learning paradigms -- Supervised learning -- Supervised learning by regression

Computing methodologies -- Machine learning -- Machine learning approaches -- Bio-inspired approaches -- Genetic programming

Recent work has argued that many machine learning techniques exhibit a 'double descent' in model risk, where increasing model complexity beyond an interpolation zone can overcome the bias-variance tradeoff to produce large, over-parameterised models that generalise well to unseen data. While the double descent characteristic has been identified in many learning methods, it has not been explored within symbolic regression research. This paper presents an initial exploration into the presence of double descent behaviour in symbolic regression over a range of parameter settings. Results suggest that symbolic regression via genetic programming does not exhibit a clear double descent risk curve relative to model size or function set. Unlike other methods, models evolved through symbolic regression do not appear to strongly interpolate training data, which promotes a degree of robustness towards noise in training data. However, models evolved by symbolic regression can still be large and do not present a strong overfitting characteristic. Given that a prime motivation for symbolic regression is to produce compact interpretable models, these results suggest that methods aimed at regularising evolved models should be a key feature of all symbolic regression methods.

Metrics

1 Record Views

Details

Record Identifier: 9926552000901891
Title: Characterising the Double Descent of Symbolic Regression
Creators: Grant Dick
Caitlin Owen
Publication Details: Proceedings of the Genetic and Evolutionary Computation Conference Companion, pp.2050-2057
Conference: GECCO '24 Companion: Genetic and Evolutionary Computation Conference Companion
Academic Unit: School of Computing
Publisher: ACM
Date published ; e-published: 14/07/2024
Language: English
Resource Type; Subtype: Conference proceeding

Characterising the Double Descent of Symbolic Regression

Abstract

Related links

Metrics

Details

Usage Policy