Characterizing the Effects of Random Subsampling on Lexicase Selection

This repository holds the code behind our work for the 2019 Genetic Programming Theory and Practice (GPTP) workshop.

All code is modified and simplified from GECCO 2019 code by Alexander Lalejini and Jose Hernandez here: https://github.com/amlalejini/GECCO-2019-cohort-lexicase

Background

Previous work has shown that by applying random subsampling to lexicase selection, we can reduce the number of evaluations needed to acheive satisfactory results. However, while the theoretical differences between subsampled methods (e.g., cohort and downsampled lexicase) are obvious, in practice they seem to perform similarly. The aim of this work is to characterize the differences of these selection techniques to help guide when they should be used.

Reference

The tag-based linear genetic programming system used in this work is the same as previous work, and can be found here. (Note that we used a larger population size of 1,000 candidate solutions.)

All diversity maintenance techniques come from the paper "Quantifying the Tape of Life: Ancestry-based Metrics Provide Insights and Intuition about Evolutionary Dynamics" from ALife 2018.

Setup

There are two dependecies for this repository to run:

Empirical - A branch of the Empirical library.

git clone git@github.com:emilydolson/Empirical.git
git checkout memic_model

csv-parser

git clone git@github.com:AriaFallah/csv-parser.git

Once both dependencies are downloaded, the makefile (in the repo's root) needs be edited (should be the first two lines) so the compiler can find them:

EMP_DIR := PATH_TO_YOUR_EMPIRICAL_DIR/source
PARSER_DIR := PATH_TO_YOUR_CSV_PARSER

After that the makefile should do the trick!

make
./gptp2019

FergusonAJ/gptp-2019-subsampled-lexicase

Characterizing the Effects of Random Subsampling on Lexicase Selection

Background

Directory

Reference

Setup