Logan-lxw's Stars
JasonMa2016/SMODICE
Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML 2022)
KAIST-AILab/imitation-dice
jhejna/cpl
Code for Contrastive Preference Learning (CPL)
dmksjfl/SEABO
Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning
ott-jax/ott
Optimal transport tools implemented with the JAX framework, to get differentiable, parallel and jit-able computations.
ethanluoyc/optimal_transport_reward
polixir/morec
kaixindelele/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
Zzl35/flow-to-better
yihaosun1124/OfflineRL-Kit
An elegant PyTorch offline reinforcement learning library for researchers.
FelipeNuti/diffusion-relative-rewards
Codebase for Extracting Reward Functions from Diffusion Models
HxLyn3/Diaster
Episodic Return Decomposition by Difference of Implicitly Assigned Sub-Trajectory Reward
Elegycloud/clash-for-linux-backup
基于Clash Core 制作的Clash For Linux备份仓库 A Clash For Linux Backup Warehouse Based on Clash Core
csmile-1006/PreferenceTransformer
Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)
G0K0URURI/CROP
Code for paper "CROP: Conservative Reward for Model-based Offline Policy Optimization".
junming-yang/mopo
Model-based Offline Policy Optimization re-implement all by pytorch
aimhubio/aim
Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
ZhengyaoJiang/latentplan
Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.
ChenDRAG/SfBC
Codes accompanying the paper "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling" (ICLR 2023) https://arxiv.org/abs/2209.14548
Zhendong-Wang/Diffusion-Policies-for-Offline-RL
seohongpark/HIQL
HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)
jannerm/diffuser
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
gckor/Guider
google-deepmind/acme
A library of reinforcement learning components and agents
ademiadeniji/irm
Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)
Farama-Foundation/D4RL
A collection of reference environments for offline reinforcement learning
ldcq/ldcq
PKU-Alignment/omnisafe
JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.
liuzuxin/DSRL
🔥 Datasets and env wrappers for offline safe reinforcement learning
liuzuxin/FSRL
🚀 A fast safe reinforcement learning library in PyTorch