yudasong

PhD at MLD, CMU.

CMU

Pinned Repositories

16831-spring-2021
Language:TeX0 1 00
App
Language:Swift0 2 00
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python0 2 01
briee
Representation Learning in RL
Language:Python16 2 11
Canvas
Language:Swift0 2 00
DQfD
An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games
Language:Python1 0 01
HyQ
Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.
Language:Python24 1 23
PCMLP
Language:Python3 2 04
policy_adapt
Code for paper Provably Efficient Model-based Policy Adaptation
Language:Python7 1 00
Reinforcement-Learning-Branch-and-Bound
Language:Python16 5 07

yudasong's Repositories

yudasong/HyQ
Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.
Language:Python24 1 23
yudasong/briee
Representation Learning in RL
Language:Python16 2 11
yudasong/Reinforcement-Learning-Branch-and-Bound
Language:Python16 5 07
yudasong/policy_adapt
Code for paper Provably Efficient Model-based Policy Adaptation
Language:Python7 1 00
yudasong/PCMLP
Language:Python3 2 04
yudasong/DQfD
An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games
Language:Python1 0 01
yudasong/16831-spring-2021
Language:TeX0 1 00
yudasong/App
Language:Swift0 2 00
yudasong/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python0 2 01
yudasong/Canvas
Language:Swift0 2 00
yudasong/DI-engine
OpenDILab Decision AI Engine
Language:Python0 0
yudasong/dmc2gym
OpenAI Gym wrapper for the DeepMind Control Suite
Language:Python1 0
yudasong/DOC3-Project
Language:Swift2 0
yudasong/homework
Assignments for CS294-112.
Language:Python2 0
yudasong/Instagram
Language:Swift2 1
yudasong/maml_rl
Code for RL experiments in "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
Language:Python2 0
yudasong/models
Models and examples built with TensorFlow
Language:Python2 0
yudasong/PCPG
Language:Python1 0
yudasong/Photo-Map
Language:Swift2 0
yudasong/policy_transfer
Language:Python2 0
yudasong/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python3 0
yudasong/pytorch_sac_ae
PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)
Language:Python1 0
yudasong/reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
Language:Python2 0
yudasong/replication-mbpo
NeurIPS Reproducibility Challenge 2019
Language:Python1 0
yudasong/slbo
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
Language:Python1 0
yudasong/TeamFormation
Language:Java3 0
yudasong/temperature
Language:Java2 0
yudasong/Tinder
Language:Swift2 0
yudasong/toolkit
Language:Python
yudasong/udacity_NN
Language:Jupyter Notebook2 0