Pinned Repositories
lambda_discrepancy
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
TextWorld
TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.
airbnb-helper
Project for datadive using the Airbnb dataset
aux-inputs
reinforcement learning with auxiliary inputs
generalization-rl
Code for our CMPUT 607 project, based on the paper "Protecting Against Evaluation Overfitting in Empirical Reinforcement Learning"
knowyourvc-chrome-extension
The Know Your VC Chrome Extension!
lstm-contextual-decomposition
Reproducing "Beyond Word Importance: Contextual Decomposition to Extract Interactions from LSTMs"
nsrs
Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.
Pytorch-DQN
Pytorch DQN implementation to play Breakout
Pytorch-PolicyGradient
Pytorch implementation for Policy Gradients
taodav's Repositories
taodav/nsrs
Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.
taodav/aux-inputs
reinforcement learning with auxiliary inputs
taodav/generalization-rl
Code for our CMPUT 607 project, based on the paper "Protecting Against Evaluation Overfitting in Empirical Reinforcement Learning"
taodav/TextWorldACG
Scripts for generating the TextWorldACG dataset (https://arxiv.org/abs/1812.00855)
taodav/balloon-learning-environment
The Balloon Learning Environment - flying stratospheric balloons with deep reinforcement learning.
taodav/bandits
Just some stuff on bandits
taodav/COMP421-D3-Q2
stuff
taodav/cs-useful-things
Useful things that I've accumulated as an undergrad/grad student studying Computer Science.
taodav/dreamerv3
Mastering Diverse Domains through World Models
taodav/grl
taodav/jaxrenderer
Differentiable Rasteriser implemented in JAX. Reference: https://github.com/erwincoumans/tinyrenderer, https://github.com/ssloy/tinyrenderer/wiki; PR: https://github.com/google/brax/pull/367
taodav/jelly-bean-world
A framework for experimenting with never-ending learning
taodav/jumanji
🕹️ A diverse suite of scalable reinforcement learning environments in JAX
taodav/kobuddy
Kobo database backup and parser: extracts notes, highlights, reading progress and more
taodav/mc
minecraft server for the frens
taodav/MCTS
Monte Carlo Tree Search for Q-value approximation
taodav/meta-learning
Implementations of meta-learning algorithms in TensorFlow. For use in one-shot facial recognition.
taodav/MuZero
A structured implementation of MuZero
taodav/onager
Lightweight python library for launching experiments and tuning hyperparameters, either locally or on a cluster
taodav/personal-site
My personal website - built with React, React-Router, React-Snap for Static-Export, and GitHub Pages.
taodav/pomdp-py
A framework to build and solve POMDP problems. Documentation: https://h2r.github.io/pomdp-py/
taodav/qmk_firmware
Open-source keyboard firmware for Atmel AVR and Arm USB families
taodav/Rainbow
Rainbow: Combining Improvements in Deep Reinforcement Learning
taodav/rewardpredictive
taodav/rl-competition
Repository for the 2009 RL Competition codebase
taodav/RL-Coursera
Implementations of Coursera Reinforcement Learning Specialization
taodav/rlpyt
Reinforcement Learning in PyTorch
taodav/slack-bixi-bot
A small slack bot to check the status of a given bixi status
taodav/stuff
taodav/taodav.github.io
My Personal Website