taodav

Just here trying to tile my wall a nice shade of green

Brown UniversityProvidence

Pinned Repositories

lambda_discrepancy
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
Language:Python14 1 00
TextWorld
TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.
Language:Jupyter Notebook1.2k 39 84188
airbnb-helper
Project for datadive using the Airbnb dataset
Language:Jupyter Notebook1 2 00
aux-inputs
reinforcement learning with auxiliary inputs
Language:Jupyter Notebook1 1 01
generalization-rl
Code for our CMPUT 607 project, based on the paper "Protecting Against Evaluation Overfitting in Empirical Reinforcement Learning"
Language:Python1 3 00
knowyourvc-chrome-extension
The Know Your VC Chrome Extension!
Language:JavaScript5 1 00
lstm-contextual-decomposition
Reproducing "Beyond Word Importance: Contextual Decomposition to Extract Interactions from LSTMs"
Language:Jupyter Notebook5 2 01
nsrs
Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.
Language:Jupyter Notebook15 2 13
Pytorch-DQN
Pytorch DQN implementation to play Breakout
Language:Jupyter Notebook4 1 02
Pytorch-PolicyGradient
Pytorch implementation for Policy Gradients
Language:Jupyter Notebook2 1 00

taodav's Repositories

taodav/nsrs
Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.
Language:Jupyter Notebook15 2 13
taodav/aux-inputs
reinforcement learning with auxiliary inputs
Language:Jupyter Notebook1 1 01
taodav/generalization-rl
Code for our CMPUT 607 project, based on the paper "Protecting Against Evaluation Overfitting in Empirical Reinforcement Learning"
Language:Python1 3 00
taodav/TextWorldACG
Scripts for generating the TextWorldACG dataset (https://arxiv.org/abs/1812.00855)
Language:Python1 3 01
taodav/balloon-learning-environment
The Balloon Learning Environment - flying stratospheric balloons with deep reinforcement learning.
Language:Jupyter Notebook
taodav/bandits
Just some stuff on bandits
Language:Jupyter Notebook2 0
taodav/COMP421-D3-Q2
stuff
Language:Java
taodav/cs-useful-things
Useful things that I've accumulated as an undergrad/grad student studying Computer Science.
3 0
taodav/dreamerv3
Mastering Diverse Domains through World Models
Language:Python0 0
taodav/grl
Language:Python0 01
taodav/jaxrenderer
Differentiable Rasteriser implemented in JAX. Reference: https://github.com/erwincoumans/tinyrenderer, https://github.com/ssloy/tinyrenderer/wiki; PR: https://github.com/google/brax/pull/367
taodav/jelly-bean-world
A framework for experimenting with never-ending learning
Language:C++1 0
taodav/jumanji
🕹️ A diverse suite of scalable reinforcement learning environments in JAX
Language:Python0 0
taodav/kobuddy
Kobo database backup and parser: extracts notes, highlights, reading progress and more
Language:Python0 0
taodav/mc
minecraft server for the frens
Language:Shell
taodav/MCTS
Monte Carlo Tree Search for Q-value approximation
Language:Python1 0
taodav/meta-learning
Implementations of meta-learning algorithms in TensorFlow. For use in one-shot facial recognition.
Language:Python
taodav/MuZero
A structured implementation of MuZero
Language:Python2 0
taodav/onager
Lightweight python library for launching experiments and tuning hyperparameters, either locally or on a cluster
Language:Python0 0
taodav/personal-site
My personal website - built with React, React-Router, React-Snap for Static-Export, and GitHub Pages.
Language:SCSS1 0
taodav/pomdp-py
A framework to build and solve POMDP problems. Documentation: https://h2r.github.io/pomdp-py/
Language:Python0 0
taodav/qmk_firmware
Open-source keyboard firmware for Atmel AVR and Arm USB families
Language:C1 0
taodav/Rainbow
Rainbow: Combining Improvements in Deep Reinforcement Learning
Language:Python
taodav/rewardpredictive
Language:Jupyter Notebook1 01
taodav/rl-competition
Repository for the 2009 RL Competition codebase
Language:Java4 0
taodav/RL-Coursera
Implementations of Coursera Reinforcement Learning Specialization
Language:Jupyter Notebook0 0
taodav/rlpyt
Reinforcement Learning in PyTorch
Language:Python1 0
taodav/slack-bixi-bot
A small slack bot to check the status of a given bixi status
Language:Python
taodav/stuff
Language:Shell2 0
taodav/taodav.github.io
My Personal Website
Language:HTML