Pinned Repositories
Contrastive-UCB
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
DB
Dynamic Bottleneck for Robust Self-Supervised Exploration
GHER
G-HER algorithm
MINE
Mutual Information Neural Estimation (Pytorch)
OB2I
Code for "Principled Exploration via Optimistic Bootstrapping and Backward Induction"
PBRL
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
Pix2Pix-eager
Tensorflow eager implementation of Pix2Pix (Image-to-image translation with conditional adversarial networks)
Tensorflow-TCN
Tensorflow eager implementation of Temporal Convolutional Network (TCN)
UTDS
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline RL
VDM
Code for "Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning"
Baichenjia's Repositories
Baichenjia/PBRL
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
Baichenjia/UTDS
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline RL
Baichenjia/Contrastive-UCB
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Baichenjia/OB2I
Code for "Principled Exploration via Optimistic Bootstrapping and Backward Induction"
Baichenjia/DB
Dynamic Bottleneck for Robust Self-Supervised Exploration
Baichenjia/MQN-offline
Monotonic Quantile Network for Worst-Case Offline Reinforcement Learning
Baichenjia/VDM
Code for "Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning"
Baichenjia/CeSD
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Baichenjia/BeCL-MI-Entropy
Mutual Information and Entropy estimation of BeCL and baseline methods
Baichenjia/Embodied-Survey
Figures for Embodied Survey
Baichenjia/offline_safe_rl
Baichenjia/probabilistic-ensemble
Baichenjia/baichenjia.github.io
github pages
Baichenjia/BeCL
BeCL: Behavior Contrastive Learning for Unsupervised Skill Discovery.
Baichenjia/CAVE_NoisyMinist
Baichenjia/cliport
CLIPort: What and Where Pathways for Robotic Manipulation
Baichenjia/CODAC
Baichenjia/CVAE_exploration
Baichenjia/EBU
Reproduce for "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update" (NeurIPS 2019) with Tensorflow
Baichenjia/humanplus
HumanPlus: Humanoid Shadowing and Imitation from Humans
Baichenjia/LearningHumanoidWalking
Training a humanoid robot for locomotion using Reinforcement Learning
Baichenjia/O-RAAC
Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting
Baichenjia/offline-rl-neurips.github.io
Baichenjia/parkour
[CoRL 2023] Robot Parkour Learning
Baichenjia/Rebuttal-OEB3
Rebuttal of OEB3
Baichenjia/SELM
SELM
Baichenjia/SunRise-Constrain
SunRise-Constrain
Baichenjia/tdmpc
Code for "Temporal Difference Learning for Model Predictive Control"
Baichenjia/Temporary_D3IL
Baichenjia/walk-these-ways
Sim-to-real RL training and deployment tools for the Unitree Go1 robot.