williamd4112

Pinned Repositories

AI-Monte-Carlo-Chess
Language:C++0 3 00
async-rl
Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)
Language:Python0 1 00
awesome-deep-reinforcement-learning
A collection of resources about deep reinforcement learning
24 4 06
awesome-point-cloud-processing
A curated list of awesome Point Cloud Processing Resources, Libraries, Software
0 1 01
gym-atari-human
A user interface for human in OpenAI gym atari.
Language:Python3 1 02
gym-unity3d
An OpenAI Gym interface for Unity3D
Language:C#8 6 12
infer-policy-feature
Language:Python8 3 04
simple-linear-classification
A python implementation of linear classification algorithm (including Probabilistic Generative Model, Probabilistic Discriminative Model). (See Pattern Recognition and Machine Learning, Bishop)
Language:Python7 2 02
simple-linear-regression
A python implementation of linear regression algorithm. (including Maximum Likelihood, Maximum a posterior, Bayesian)
Language:Python8 1 02
tensorflow-action-conditional-video-prediction
A python tensorflow implementation of nips2015-action-conditional-video-prediction
Language:Python12 2 14

williamd4112's Repositories

williamd4112/williamd4112.github.io
Language:HTML3 1 0
williamd4112/suboptimal_offline_datasets
Language:Python2 1 0
williamd4112/6.866-HW5
Language:Python1 0
williamd4112/ARS
An implementation of the Augmented Random Search algorithm
Language:Python0 0
williamd4112/BCO
behavior cloning from observation
Language:Jupyter Notebook0 0
williamd4112/BGRL
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
Language:Jupyter Notebook0 0
williamd4112/CityLearn
Official reinforcement learning environment for demand response and load shaping
Language:Python0 0
williamd4112/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python0 02
williamd4112/curiosity_baselines
An open source reinforcement learning codebase with a variety of intrinsic curiosity methods implemented in PyTorch on top of rlpyt.
Language:Python0 0
williamd4112/deep_control
Deep Reinforcement Learning for Continuous Control in PyTorch
Language:Python0 0
williamd4112/Dr-Jekyll-and-Mr-Hyde-The-Strange-Case-of-Off-Policy-Policy-Updates
Code for Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates: https://arxiv.org/abs/2109.14727
Language:Python0 0
williamd4112/dreamerv2
Mastering Atari with Discrete World Models
Language:Python0 0
williamd4112/easy_experiments
A easy-to-modify tool for launching many experiments.
Language:Python1 0
williamd4112/explore_establish_exploit_llms
Language:Python0 0
williamd4112/fuzz4all
🌌️Fuzz4All: Universal Fuzzing with Large Language Models
Language:Python0 0
williamd4112/gdown
Download a large file from Google Drive (curl/wget fails because of the security notice).
Language:Python0 0
williamd4112/gym-maze
A customizable gym environment for maze/gridworld
Language:Python2 0
williamd4112/jaynes
A package for running ML training on SLURM, AWS, GCE, and physical boxes with or without docker
Language:Python0 0
williamd4112/jaynes-starter-kit
a starter-kit for jaynes, the cloud-agnostic launch library
Language:Python0 0
williamd4112/mrl
Language:Python0 0
williamd4112/multiworld
Multitask Environments for RL
Language:Python0 0
williamd4112/phi_gcn
Language:Python0 0
williamd4112/py-ttc
Python implementation of the vision-based direct methods of time-to-contact (TTC) estimation
Language:Python1 0
williamd4112/pythagora
Generate automated tests for your Node.js app via LLMs without developers having to write a single line of code.
Language:JavaScript0 0
williamd4112/PythonLinearNonlinearControl
PythonLinearNonLinearControl is a library implementing the linear and nonlinear control theories in python.
Language:Python0 0
williamd4112/RL4LMs
A modular RL library to fine-tune language models to human preferences
Language:Python0 0
williamd4112/robovat
RoboVat: A unified toolkit for simulated and real-world robotic task environments.
Language:Python0 0
williamd4112/trl
Train transformer language models with reinforcement learning.
Language:Python0 0
williamd4112/vizdoomgym
OpenAI Gym wrapper for ViZDoom enviroments
Language:Python0 0
williamd4112/WNPG
implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies
Language:Python0 0