Pinned Repositories
causal-rl
CRL by using Observational and interventional data
cid-in-rl
Code for the paper: "Causal Influence Detection for Improving Efficiency in Reinforcement Learning", by Seitzer, M., Schölkopf, B., Martius, G., NeurIPS 2021
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
DEAR
Disentangled gEnerative cAusal Representation (DEAR)
GRADER
This is the official implementation of NeurIPS 2022 paper "Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal Reasoning"
Grounding_LLMs_with_online_RL
We perform functional grounding of LLMs' knowledge in BabyAI-Text
LLM4RL
A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM
ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
reasoning-teacher
[ACL 2023] Large Language Models Are Reasoning Teachers
CRLqinliang's Repositories
CRLqinliang/causal-rl
CRL by using Observational and interventional data
CRLqinliang/cid-in-rl
Code for the paper: "Causal Influence Detection for Improving Efficiency in Reinforcement Learning", by Seitzer, M., Schölkopf, B., Martius, G., NeurIPS 2021
CRLqinliang/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
CRLqinliang/DEAR
Disentangled gEnerative cAusal Representation (DEAR)
CRLqinliang/GRADER
This is the official implementation of NeurIPS 2022 paper "Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal Reasoning"
CRLqinliang/Grounding_LLMs_with_online_RL
We perform functional grounding of LLMs' knowledge in BabyAI-Text
CRLqinliang/LLM4RL
A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM
CRLqinliang/ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
CRLqinliang/PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
CRLqinliang/reasoning-teacher
[ACL 2023] Large Language Models Are Reasoning Teachers
CRLqinliang/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
CRLqinliang/RL-DeepMind-stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
CRLqinliang/sac-discrete.pytorch
PyTorch implementation of SAC-Discrete.
CRLqinliang/SAC-Q
PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)
CRLqinliang/scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
CRLqinliang/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
CRLqinliang/Test
CRLqinliang/wrs
The WRS Robot Planning & Control System