minqi

Lucida LabsOxford, UK

Pinned Repositories

dcd
Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.
Language:Python121 6 925
level-replay
This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the fact that not all levels are equally useful for agents to learn from during training.
Language:Python82 9 316
minihack
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Language:Python471 12 4054
minimax
Efficient baselines for autocurricula in JAX.
Language:Python165 6 414
alphazero
Generic implementation of AlphaZero
Language:Python7 3 00
hnatt
Train and visualize Hierarchical Attention Networks
Language:Python202 11 835
learning-to-communicate-pytorch
Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch
Language:Python342 16 177
procgen
Procgen Benchmark: Procedurally Generated Game-Like Gym Environments
Language:C++1 3 03
PyMDP
Markov decision processes in Python
Language:Python5 2 03
wordcraft
An environment for benchmarking commonsense agents
Language:Python28 3 07

minqi's Repositories

minqi/webreactants
A simple Redux + Express starter project built with Webpack
Language:JavaScript1 3 00
minqi/flux
Application Architecture for Building User Interfaces
Language:JavaScript2 0
minqi/papers-joschu
Collection of papers
2 0
minqi/smartdose
Language:Python3 27
minqi/switch
React Switch
Language:JavaScript2 0
minqi/vimrc
my .vimrc
Language:Vim Script2 0