Pinned Repositories
16831-spring-2021
App
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
briee
Representation Learning in RL
Canvas
DQfD
An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games
HyQ
Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.
PCMLP
policy_adapt
Code for paper Provably Efficient Model-based Policy Adaptation
Reinforcement-Learning-Branch-and-Bound
yudasong's Repositories
yudasong/HyQ
Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.
yudasong/briee
Representation Learning in RL
yudasong/Reinforcement-Learning-Branch-and-Bound
yudasong/policy_adapt
Code for paper Provably Efficient Model-based Policy Adaptation
yudasong/PCMLP
yudasong/DQfD
An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games
yudasong/16831-spring-2021
yudasong/App
yudasong/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
yudasong/Canvas
yudasong/DI-engine
OpenDILab Decision AI Engine
yudasong/dmc2gym
OpenAI Gym wrapper for the DeepMind Control Suite
yudasong/DOC3-Project
yudasong/homework
Assignments for CS294-112.
yudasong/Instagram
yudasong/maml_rl
Code for RL experiments in "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
yudasong/models
Models and examples built with TensorFlow
yudasong/PCPG
yudasong/Photo-Map
yudasong/policy_transfer
yudasong/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
yudasong/pytorch_sac_ae
PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)
yudasong/reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
yudasong/replication-mbpo
NeurIPS Reproducibility Challenge 2019
yudasong/slbo
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
yudasong/TeamFormation
yudasong/temperature
yudasong/Tinder
yudasong/toolkit
yudasong/udacity_NN