Baichenjia

Reinforcement Learning

TeleAI, China Telecom

Pinned Repositories

Contrastive-UCB
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Language:Python10 2 01
DB
Dynamic Bottleneck for Robust Self-Supervised Exploration
Language:Python6 2 01
GHER
G-HER algorithm
Language:Python18 3 03
MINE
Mutual Information Neural Estimation (Pytorch)
Language:Jupyter Notebook12 2 02
OB2I
Code for "Principled Exploration via Optimistic Bootstrapping and Backward Induction"
Language:Python10 2 01
PBRL
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
Language:Python28 2 13
Pix2Pix-eager
Tensorflow eager implementation of Pix2Pix (Image-to-image translation with conditional adversarial networks)
Language:Python11 4 01
Tensorflow-TCN
Tensorflow eager implementation of Temporal Convolutional Network (TCN)
Language:Python127 6 226
UTDS
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline RL
Language:Python15 2 23
VDM
Code for "Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning"
Language:Python5 2 01

Baichenjia's Repositories

Baichenjia/PBRL
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
Language:Python28 2 13
Baichenjia/UTDS
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline RL
Language:Python15 2 23
Baichenjia/Contrastive-UCB
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Language:Python10 2 01
Baichenjia/OB2I
Code for "Principled Exploration via Optimistic Bootstrapping and Backward Induction"
Language:Python10 2 01
Baichenjia/DB
Dynamic Bottleneck for Robust Self-Supervised Exploration
Language:Python6 2 01
Baichenjia/MQN-offline
Monotonic Quantile Network for Worst-Case Offline Reinforcement Learning
Language:Python6 2 01
Baichenjia/VDM
Code for "Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning"
Language:Python5 2 01
Baichenjia/CeSD
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Language:Python4 1 22
Baichenjia/BeCL-MI-Entropy
Mutual Information and Entropy estimation of BeCL and baseline methods
Language:Jupyter Notebook2 1 0
Baichenjia/Embodied-Survey
Figures for Embodied Survey
1 1 0
Baichenjia/offline_safe_rl
Language:Python1 1 0
Baichenjia/probabilistic-ensemble
Language:Python1 2 0
Baichenjia/baichenjia.github.io
github pages
Language:HTML2 01
Baichenjia/BeCL
BeCL: Behavior Contrastive Learning for Unsupervised Skill Discovery.
Language:Python0 0
Baichenjia/CAVE_NoisyMinist
Language:Python2 0
Baichenjia/cliport
CLIPort: What and Where Pathways for Robotic Manipulation
Language:Jupyter Notebook0 0
Baichenjia/CODAC
Language:Python1 0
Baichenjia/CVAE_exploration
Language:Python2 0
Baichenjia/EBU
Reproduce for "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update" (NeurIPS 2019) with Tensorflow
Language:Python2 0
Baichenjia/humanplus
HumanPlus: Humanoid Shadowing and Imitation from Humans
Language:Python0 0
Baichenjia/LearningHumanoidWalking
Training a humanoid robot for locomotion using Reinforcement Learning
Language:Python0 0
Baichenjia/O-RAAC
Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting
Language:Python1 0
Baichenjia/offline-rl-neurips.github.io
Language:HTML1 0
Baichenjia/parkour
[CoRL 2023] Robot Parkour Learning
Language:Python0 0
Baichenjia/Rebuttal-OEB3
Rebuttal of OEB3
2 01
Baichenjia/SELM
SELM
Baichenjia/SunRise-Constrain
SunRise-Constrain
Language:Python
Baichenjia/tdmpc
Code for "Temporal Difference Learning for Model Predictive Control"
Language:Python0 0
Baichenjia/Temporary_D3IL
Language:Python0 0
Baichenjia/walk-these-ways
Sim-to-real RL training and deployment tools for the Unitree Go1 robot.
Language:Python0 0