hnekoeiq

Interested in Deep/Reinforcement Learning, Game Theory and Information Theory.

@mila-iqiaMontreal

Pinned Repositories

Lifelong-Hanabi
A Continual Multi-agent RL testbed based on Hanabi
Language:Jupyter Notebook30 4 23
LoCA
Language:Python6 4 31
RLHive
Language:Python100 9 969
academy
Ray tutorials from Anyscale
Language:Jupyter Notebook0 0 00
Artifical-Stock-Investment
0 1 00
atari-representation-learning
Code for "Unsupervised State Representation Learning in Atari"
Language:Python0 0 00
autoregressive-tree
Language:HTML4 0 02
covid_p2p_simulation
Simulator for COVID-19 spread
Language:Python1 0 018
ECE657A_Data-and-Knowledge-Modeling-and-Analysis-Assignments
Data and Knowledge Modeling and Analysis Assignments, Winter 2018
Language:Jupyter Notebook0 0 00
DEUP
Code for experiments to learn uncertainty
Language:Jupyter Notebook30 4 16

hnekoeiq's Repositories

hnekoeiq/autoregressive-tree
Language:HTML4 0 02
hnekoeiq/covid_p2p_simulation
Simulator for COVID-19 spread
Language:Python1 0 018
hnekoeiq/academy
Ray tutorials from Anyscale
Language:Jupyter Notebook0 0 00
hnekoeiq/Artifical-Stock-Investment
0 1 00
hnekoeiq/atari-representation-learning
Code for "Unsupervised State Representation Learning in Atari"
Language:Python0 0 00
hnekoeiq/ECE657A_Data-and-Knowledge-Modeling-and-Analysis-Assignments
Data and Knowledge Modeling and Analysis Assignments, Winter 2018
Language:Jupyter Notebook0 0 00
hnekoeiq/botorch
Bayesian optimization in PyTorch
Language:Python0 0
hnekoeiq/dopamine
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
Language:Jupyter Notebook0 0
hnekoeiq/dreamerv2
Mastering Atari with Discrete World Models
Language:Python0 0
hnekoeiq/emdp
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
Language:Python0 0
hnekoeiq/epymarl
An extension of the PyMARL codebase that includes additional algorithms and environment support
hnekoeiq/Folder-Structure-Conventions
Folder / directory structure options and naming conventions for software projects
0 0
hnekoeiq/hanabi-learning-environment
hanabi_learning_environment is a research platform for Hanabi experiments.
Language:Python0 0
hnekoeiq/hnekoeiq.github.io
Language:CSS1 0
hnekoeiq/maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python0 0
hnekoeiq/mctx
Monte Carlo tree search in JAX
Language:Python0 0
hnekoeiq/muzero-pytorch
Pytorch Implementation of MuZero
Language:Python0 0
hnekoeiq/PatchUp
Language:Python0 0
hnekoeiq/plan2explore
Repository for the paper "Planning to Explore via Self-Supervised World Models"
Language:Python0 0
hnekoeiq/Recurrent-Deep-Q-Learning
Solving POMDP using Recurrent networks
Language:Jupyter Notebook0 0
hnekoeiq/reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
Language:Jupyter Notebook0 0
hnekoeiq/RL-Adventure
Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL
Language:Jupyter Notebook0 0
hnekoeiq/rlpyt
Reinforcement Learning in PyTorch
Language:Python0 0
hnekoeiq/Transfer-Learning
Language:Python0 0
hnekoeiq/wiseodd.github.io
wiseodd's blog

hnekoeiq

Pinned Repositories

Lifelong-Hanabi

LoCA

RLHive

academy

Artifical-Stock-Investment

atari-representation-learning

autoregressive-tree

covid_p2p_simulation

ECE657A_Data-and-Knowledge-Modeling-and-Analysis-Assignments

DEUP

hnekoeiq's Repositories

hnekoeiq/autoregressive-tree

hnekoeiq/covid_p2p_simulation

hnekoeiq/academy

hnekoeiq/Artifical-Stock-Investment

hnekoeiq/atari-representation-learning

hnekoeiq/ECE657A_Data-and-Knowledge-Modeling-and-Analysis-Assignments

hnekoeiq/botorch

hnekoeiq/dopamine

hnekoeiq/dreamerv2

hnekoeiq/emdp

hnekoeiq/epymarl

hnekoeiq/Folder-Structure-Conventions

hnekoeiq/hanabi-learning-environment

hnekoeiq/hnekoeiq.github.io

hnekoeiq/maddpg

hnekoeiq/mctx

hnekoeiq/muzero-pytorch

hnekoeiq/PatchUp

hnekoeiq/plan2explore

hnekoeiq/Recurrent-Deep-Q-Learning

hnekoeiq/reinforcement-learning

hnekoeiq/RL-Adventure

hnekoeiq/rlpyt

hnekoeiq/Transfer-Learning

hnekoeiq/wiseodd.github.io