ILAlgo

Baseline algorithms of imitation learning and reinforcment learning , written in Pytorch. Code is tested on Mujoco.

Note: Code is under development. Issues, bug reports and PRs are welcome.

Install Dependency

pip install -r requirements

Run experiment

# train RL agent
python train_expert.py --config sac.yml --generate-data
# train non-active imitator
python train_il.py --config bc.yml
# train active imitator
python train_active_il.py --config dagger.yml

Watch Result

tensorboard --log_dir out/[your exp name]

Currently Implemented Algorithms

Reinforcement Learning

Imitation Learning

Acknowledgement

During my implementation of IL and RL algorithms, a lot of classic open-source materials on the Internet served as good references. And I highly appreciate their author's effort. Below is a detailed list. By the way, thanks to my friends from LAMDA-RL for their helpful discussions.

RL Library

Open Source Code Repo

Blog

The 37 Implementation Details of Proximal Policy Optimization

Tutorial

OpenAI SpinningUp

mansicer/ILAlgo

ILAlgo

Install Dependency

Run experiment

Currently Implemented Algorithms

Acknowledgement