hnekoeiq
Interested in Deep/Reinforcement Learning, Game Theory and Information Theory.
@mila-iqiaMontreal
Pinned Repositories
Lifelong-Hanabi
A Continual Multi-agent RL testbed based on Hanabi
LoCA
RLHive
academy
Ray tutorials from Anyscale
Artifical-Stock-Investment
atari-representation-learning
Code for "Unsupervised State Representation Learning in Atari"
autoregressive-tree
covid_p2p_simulation
Simulator for COVID-19 spread
ECE657A_Data-and-Knowledge-Modeling-and-Analysis-Assignments
Data and Knowledge Modeling and Analysis Assignments, Winter 2018
DEUP
Code for experiments to learn uncertainty
hnekoeiq's Repositories
hnekoeiq/autoregressive-tree
hnekoeiq/covid_p2p_simulation
Simulator for COVID-19 spread
hnekoeiq/academy
Ray tutorials from Anyscale
hnekoeiq/Artifical-Stock-Investment
hnekoeiq/atari-representation-learning
Code for "Unsupervised State Representation Learning in Atari"
hnekoeiq/ECE657A_Data-and-Knowledge-Modeling-and-Analysis-Assignments
Data and Knowledge Modeling and Analysis Assignments, Winter 2018
hnekoeiq/botorch
Bayesian optimization in PyTorch
hnekoeiq/dopamine
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
hnekoeiq/dreamerv2
Mastering Atari with Discrete World Models
hnekoeiq/emdp
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
hnekoeiq/epymarl
An extension of the PyMARL codebase that includes additional algorithms and environment support
hnekoeiq/Folder-Structure-Conventions
Folder / directory structure options and naming conventions for software projects
hnekoeiq/hanabi-learning-environment
hanabi_learning_environment is a research platform for Hanabi experiments.
hnekoeiq/hnekoeiq.github.io
hnekoeiq/maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
hnekoeiq/mctx
Monte Carlo tree search in JAX
hnekoeiq/muzero-pytorch
Pytorch Implementation of MuZero
hnekoeiq/PatchUp
hnekoeiq/plan2explore
Repository for the paper "Planning to Explore via Self-Supervised World Models"
hnekoeiq/Recurrent-Deep-Q-Learning
Solving POMDP using Recurrent networks
hnekoeiq/reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
hnekoeiq/RL-Adventure
Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL
hnekoeiq/rlpyt
Reinforcement Learning in PyTorch
hnekoeiq/Transfer-Learning
hnekoeiq/wiseodd.github.io
wiseodd's blog