Pinned Repositories
AI-Monte-Carlo-Chess
async-rl
Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)
awesome-deep-reinforcement-learning
A collection of resources about deep reinforcement learning
awesome-point-cloud-processing
A curated list of awesome Point Cloud Processing Resources, Libraries, Software
gym-atari-human
A user interface for human in OpenAI gym atari.
gym-unity3d
An OpenAI Gym interface for Unity3D
infer-policy-feature
simple-linear-classification
A python implementation of linear classification algorithm (including Probabilistic Generative Model, Probabilistic Discriminative Model). (See Pattern Recognition and Machine Learning, Bishop)
simple-linear-regression
A python implementation of linear regression algorithm. (including Maximum Likelihood, Maximum a posterior, Bayesian)
tensorflow-action-conditional-video-prediction
A python tensorflow implementation of nips2015-action-conditional-video-prediction
williamd4112's Repositories
williamd4112/williamd4112.github.io
williamd4112/suboptimal_offline_datasets
williamd4112/6.866-HW5
williamd4112/ARS
An implementation of the Augmented Random Search algorithm
williamd4112/BCO
behavior cloning from observation
williamd4112/BGRL
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
williamd4112/CityLearn
Official reinforcement learning environment for demand response and load shaping
williamd4112/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
williamd4112/curiosity_baselines
An open source reinforcement learning codebase with a variety of intrinsic curiosity methods implemented in PyTorch on top of rlpyt.
williamd4112/deep_control
Deep Reinforcement Learning for Continuous Control in PyTorch
williamd4112/Dr-Jekyll-and-Mr-Hyde-The-Strange-Case-of-Off-Policy-Policy-Updates
Code for Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates: https://arxiv.org/abs/2109.14727
williamd4112/dreamerv2
Mastering Atari with Discrete World Models
williamd4112/easy_experiments
A easy-to-modify tool for launching many experiments.
williamd4112/explore_establish_exploit_llms
williamd4112/fuzz4all
🌌️Fuzz4All: Universal Fuzzing with Large Language Models
williamd4112/gdown
Download a large file from Google Drive (curl/wget fails because of the security notice).
williamd4112/gym-maze
A customizable gym environment for maze/gridworld
williamd4112/jaynes
A package for running ML training on SLURM, AWS, GCE, and physical boxes with or without docker
williamd4112/jaynes-starter-kit
a starter-kit for jaynes, the cloud-agnostic launch library
williamd4112/mrl
williamd4112/multiworld
Multitask Environments for RL
williamd4112/phi_gcn
williamd4112/py-ttc
Python implementation of the vision-based direct methods of time-to-contact (TTC) estimation
williamd4112/pythagora
Generate automated tests for your Node.js app via LLMs without developers having to write a single line of code.
williamd4112/PythonLinearNonlinearControl
PythonLinearNonLinearControl is a library implementing the linear and nonlinear control theories in python.
williamd4112/RL4LMs
A modular RL library to fine-tune language models to human preferences
williamd4112/robovat
RoboVat: A unified toolkit for simulated and real-world robotic task environments.
williamd4112/trl
Train transformer language models with reinforcement learning.
williamd4112/vizdoomgym
OpenAI Gym wrapper for ViZDoom enviroments
williamd4112/WNPG
implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies