Pinned Repositories
dcd
Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.
level-replay
This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the fact that not all levels are equally useful for agents to learn from during training.
minihack
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
minimax
Efficient baselines for autocurricula in JAX.
alphazero
Generic implementation of AlphaZero
hnatt
Train and visualize Hierarchical Attention Networks
learning-to-communicate-pytorch
Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch
procgen
Procgen Benchmark: Procedurally Generated Game-Like Gym Environments
PyMDP
Markov decision processes in Python
wordcraft
An environment for benchmarking commonsense agents
minqi's Repositories
minqi/webreactants
A simple Redux + Express starter project built with Webpack
minqi/flux
Application Architecture for Building User Interfaces
minqi/papers-joschu
Collection of papers
minqi/smartdose
minqi/switch
React Switch
minqi/vimrc
my .vimrc