CRLqinliang

This is a good beginning.

Pinned Repositories

causal-rl
CRL by using Observational and interventional data
Language:Python00
cid-in-rl
Code for the paper: "Causal Influence Detection for Improving Efficiency in Reinforcement Learning", by Seitzer, M., Schölkopf, B., Martius, G., NeurIPS 2021
Language:Python00
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python00
DEAR
Disentangled gEnerative cAusal Representation (DEAR)
Language:Python00
GRADER
This is the official implementation of NeurIPS 2022 paper "Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal Reasoning"
Language:Python00
Grounding_LLMs_with_online_RL
We perform functional grounding of LLMs' knowledge in BabyAI-Text
Language:Python00
LLM4RL
A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM
Language:Python00
ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
Language:Python00
PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
Language:Python0 0 00
reasoning-teacher
[ACL 2023] Large Language Models Are Reasoning Teachers
Language:Jupyter Notebook00

CRLqinliang's Repositories

CRLqinliang/causal-rl
CRL by using Observational and interventional data
Language:Python00
CRLqinliang/cid-in-rl
Code for the paper: "Causal Influence Detection for Improving Efficiency in Reinforcement Learning", by Seitzer, M., Schölkopf, B., Martius, G., NeurIPS 2021
Language:Python00
CRLqinliang/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python00
CRLqinliang/DEAR
Disentangled gEnerative cAusal Representation (DEAR)
Language:Python00
CRLqinliang/GRADER
This is the official implementation of NeurIPS 2022 paper "Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal Reasoning"
Language:Python00
CRLqinliang/Grounding_LLMs_with_online_RL
We perform functional grounding of LLMs' knowledge in BabyAI-Text
Language:Python00
CRLqinliang/LLM4RL
A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM
Language:Python00
CRLqinliang/ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
Language:Python00
CRLqinliang/PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
Language:Python0 0 00
CRLqinliang/reasoning-teacher
[ACL 2023] Large Language Models Are Reasoning Teachers
Language:Jupyter Notebook00
CRLqinliang/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
CRLqinliang/RL-DeepMind-stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
CRLqinliang/sac-discrete.pytorch
PyTorch implementation of SAC-Discrete.
CRLqinliang/SAC-Q
PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)
CRLqinliang/scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
CRLqinliang/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language:Python
CRLqinliang/Test
CRLqinliang/wrs
The WRS Robot Planning & Control System
Language:Python

CRLqinliang

Pinned Repositories

causal-rl

cid-in-rl

cleanrl

DEAR

GRADER

Grounding_LLMs_with_online_RL

LLM4RL

ppo-implementation-details

PyTorch-VAE

reasoning-teacher

CRLqinliang's Repositories

CRLqinliang/causal-rl

CRLqinliang/cid-in-rl

CRLqinliang/cleanrl

CRLqinliang/DEAR

CRLqinliang/GRADER

CRLqinliang/Grounding_LLMs_with_online_RL

CRLqinliang/LLM4RL

CRLqinliang/ppo-implementation-details

CRLqinliang/PyTorch-VAE

CRLqinliang/reasoning-teacher

CRLqinliang/rl-baselines3-zoo

CRLqinliang/RL-DeepMind-stable-baselines

CRLqinliang/sac-discrete.pytorch

CRLqinliang/SAC-Q

CRLqinliang/scenic

CRLqinliang/stable-baselines3

CRLqinliang/Test

CRLqinliang/wrs