Logan-lxw

Logan-lxw's Stars

JasonMa2016/SMODICE
Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML 2022)
Language:Python253
KAIST-AILab/imitation-dice
Language:Python176
jhejna/cpl
Code for Contrastive Preference Learning (CPL)
Language:Python15213
dmksjfl/SEABO
Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning
Language:Python10
ott-jax/ott
Optimal transport tools implemented with the JAX framework, to get differentiable, parallel and jit-able computations.
Language:Python52080
ethanluoyc/optimal_transport_reward
Language:Python134
polixir/morec
Language:Python72
kaixindelele/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
Language:Python18.4k1.9k
Zzl35/flow-to-better
Language:Python171
yihaosun1124/OfflineRL-Kit
An elegant PyTorch offline reinforcement learning library for researchers.
Language:Python27433
FelipeNuti/diffusion-relative-rewards
Codebase for Extracting Reward Functions from Diffusion Models
Language:Python123
HxLyn3/Diaster
Episodic Return Decomposition by Difference of Implicitly Assigned Sub-Trajectory Reward
Language:Python5
Elegycloud/clash-for-linux-backup
基于Clash Core 制作的Clash For Linux备份仓库 A Clash For Linux Backup Warehouse Based on Clash Core
Language:Shell2.4k985
csmile-1006/PreferenceTransformer
Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)
Language:Python15017
G0K0URURI/CROP
Code for paper "CROP: Conservative Reward for Model-based Offline Policy Optimization".
Language:Python7
junming-yang/mopo
Model-based Offline Policy Optimization re-implement all by pytorch
Language:Python276
aimhubio/aim
Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
Language:Python5.2k320
ZhengyaoJiang/latentplan
Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.
Language:Python9411
ChenDRAG/SfBC
Codes accompanying the paper "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling" (ICLR 2023) https://arxiv.org/abs/2209.14548
Language:Python425
Zhendong-Wang/Diffusion-Policies-for-Offline-RL
Language:Python26636
seohongpark/HIQL
HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)
Language:Python767
jannerm/diffuser
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
Language:Python868135
gckor/Guider
Language:Python1
google-deepmind/acme
A library of reinforcement learning components and agents
Language:Python3.5k426
ademiadeniji/irm
Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)
Language:Python437
Farama-Foundation/D4RL
A collection of reference environments for offline reinforcement learning
Language:Python1.3k284
ldcq/ldcq
Language:Python263
PKU-Alignment/omnisafe
JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.
Language:Python931132
liuzuxin/DSRL
🔥 Datasets and env wrappers for offline safe reinforcement learning
Language:Python724
liuzuxin/FSRL
🚀 A fast safe reinforcement learning library in PyTorch
Language:Python16026