offline-reinforcement-learning

There are 82 repositories under offline-reinforcement-learning topic.

tinkoff-ai/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Language:Python1.3k 17 28154
ikostrikov/jaxrl
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
Language:Jupyter Notebook713 14 872
yihaosun1124/OfflineRL-Kit
An elegant PyTorch offline reinforcement learning library for researchers.
Language:Python358 6 1136
Allenpandas/Reinforcement-Learning-Papers
📚 List of Top-tier Conference Papers on Reinforcement Learning (RL)，including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.
336 13 334
Cryolite/kanachan
A Japanese (Riichi) Mahjong AI Framework
Language:Python320 15 2840
nikhilbarhate99/min-decision-transformer
Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym
Language:Python279 2 728
instadeepai/og-marl
Datasets with baselines for Offline MARL.
Language:Python178 7 1714
polixir/OfflineRL
A collection of offline reinforcement learning algorithms.
Language:Python175 4 1020
nissymori/JAX-CORL
Clean single-file implementation of offline RL algorithms in JAX
Language:Python154 4 254
BY571/CQL
PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.
Language:Python141 3 723
polixir/NeoRL
Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets
Language:Python117 5 1212
ZhengyaoJiang/latentplan
Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.
Language:Python105 2 212
ZhengYinan-AIR/FISOR
[ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"
Language:Python94 3 77
snu-mllab/EDAC
Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)
Language:Python76 2 26
DHDev0/Stochastic-muzero
Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.
Language:Python71 4 1010
ryanxhr/POR
[NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
Language:Python58 2 27
tinkoff-ai/ReBRAC
Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
Language:Jupyter Notebook55 2 06
tinkoff-ai/sac-rnd
Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
Language:Python53 3 05
Howuhh/sac-n-jax
Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch
Language:Python52 1 13
LanqingLi1993/FOCAL-ICLR
Code for FOCAL Paper Published at ICLR 2021
Language:Python52 3 119
snu-mllab/DPPO
Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)
Language:Python42 2 21
ryanxhr/DWBC
[ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
Language:Python34 1 22
ZhengYinan-AIR/OMIGA
[NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization"
Language:Python33 2 83
holarissun/RewardShifting
Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL
Language:Python29 2 03
LoopMind-AI/loopquest
A Production Tool for Embodied AI
Language:Python29 2 21
sail-sg/rosmo
Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023
Language:Python29 6 30
BayesBrain/Habi
Official PyTorch Implementation of Habitizing Diffusion Planning for Efficient and Effective Decision Making
Language:Python27
YangRui2015/AWGCSL
Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.
Language:Python26 1 02
kschweig/OfflineRL
Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning
Language:Jupyter Notebook25 1 16
ltlhuuu/A2PR
[ICML 2024] The offical implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage-guided policy regularization method, in Pytorch
Language:Python25 2 00
xionghuichen/MAPLE
The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)
Language:Python25 3 05
yudasong/HyQ
Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.
Language:Python24 1 23
zaiyan-x/RFQI
Implementation of Robust Reinforcement Learning using Offline Data [NeurIPS'22]
Language:Python24 2 33
ltlhuuu/PSEC
[ICLR 2025] The offical implementation of "PSEC: Skill Expansion and Composition in Parameter Space", a new framework designed to facilitate efficient and flexible skill expansion and composition, iteratively evolve the agents' capabilities and efficiently address new challenges
Language:Python231
Manchery/iql-pytorch
Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL
Language:Python23 1 01
DesikRengarajan/FEDORA
[NeurIPS 2024] Code for Federated Ensemble-Directed Offline Reinforcement Learning
Language:Python22 1 04

offline-reinforcement-learning

tinkoff-ai/CORL

ikostrikov/jaxrl

yihaosun1124/OfflineRL-Kit

Allenpandas/Reinforcement-Learning-Papers

Cryolite/kanachan

nikhilbarhate99/min-decision-transformer

instadeepai/og-marl

polixir/OfflineRL

nissymori/JAX-CORL

BY571/CQL

polixir/NeoRL

ZhengyaoJiang/latentplan

ZhengYinan-AIR/FISOR

snu-mllab/EDAC

DHDev0/Stochastic-muzero

ryanxhr/POR

tinkoff-ai/ReBRAC

tinkoff-ai/sac-rnd

Howuhh/sac-n-jax

LanqingLi1993/FOCAL-ICLR

snu-mllab/DPPO

ryanxhr/DWBC

ZhengYinan-AIR/OMIGA

holarissun/RewardShifting

LoopMind-AI/loopquest

sail-sg/rosmo

BayesBrain/Habi

YangRui2015/AWGCSL

kschweig/OfflineRL

ltlhuuu/A2PR

xionghuichen/MAPLE

yudasong/HyQ

zaiyan-x/RFQI

ltlhuuu/PSEC

Manchery/iql-pytorch

DesikRengarajan/FEDORA