Analysing Symbolic Regression Benchmarks under a Meta-Learning Approach

Analysing Symbolic Regression Benchmarks under a Meta-Learning Approach

Luiz Otavio V. B. Oliveira, Joao Francisco B. S. Martins, Luis F. Miranda, Gisele L. Pappa Universidade Federal de Minas Gerais, Department of Computer ScienceBelo HorizonteBrazil [luizvbo, joaofbsm, luisfmiranda, glpappa]

The definition of a concise and effective testbed for Genetic Programming (GP) is a recurrent matter in the research community. This paper takes a new step in this direction, proposing a different approach to measure the quality of the symbolic regression benchmarks quantitatively. The proposed approach is based on meta-learning and uses a set of dataset meta-features—such as the number of examples or output skewness—to describe the datasets. Our idea is to correlate these meta-features with the errors obtained by a GP method. These meta-features define a space of benchmarks that should, ideally, have datasets (points) covering different regions of the space. An initial analysis of 63 datasets showed that current benchmarks are concentrated in a small region of this benchmark space. We also found out that number of instances and output skewness are the most relevant meta-features to GP output error. Both conclusions can help define which datasets should compose an effective testbed for symbolic regression methods.

copyright: acmlicenseddoi: isbn: 978-x-xxxx-xxxx-x/YY/MMconference: GECCO ’18; July 15-19, 2018; Kyoto, Japanjournalyear: 2018price: 15.00journalyear: 2018copyright: acmlicensedconference: Genetic and Evolutionary Computation Conference Companion; July 15–19, 2018; Kyoto, Japanprice: 15.00doi: 10.1145/3205651.3208293isbn: 978-1-4503-5764-7/18/07
Comments 0
Request Comment
You are adding the first comment!
How to quickly get a good reply:
  • Give credit where it’s due by listing out the positive aspects of a paper before getting into which changes should be made.
  • Be specific in your critique, and provide supporting evidence with appropriate references to substantiate general statements.
  • Your comment should inspire ideas to flow and help the author improves the paper.

The better we are at sharing our knowledge with each other, the faster we move forward.
The feedback must be of minimum 40 characters and the title a minimum of 5 characters
Add comment
Loading ...
This is a comment super asjknd jkasnjk adsnkj
The feedback must be of minumum 40 characters
The feedback must be of minumum 40 characters

You are asking your first question!
How to quickly get a good answer:
  • Keep your question short and to the point
  • Check for grammar or spelling errors.
  • Phrase it like a question
Test description