Toying around with a Rubik's Cube RL Environment. Uses Stable Baselines for now. Curious to see if multihead attention is a valid features extractor.
To install:
git clone https://github.com/drubinstein/rubiks-rl.git
pip install -e .
python -m rubiks_rl.train
python -m rubiks_rl.environment