offline-reinforcement-learning

There are 82 repositories under offline-reinforcement-learning topic.

  • tinkoff-ai/CORL

    High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

    Language:Python1.3k1728154
  • ikostrikov/jaxrl

    JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.

    Language:Jupyter Notebook71314872
  • yihaosun1124/OfflineRL-Kit

    An elegant PyTorch offline reinforcement learning library for researchers.

    Language:Python35861136
  • Allenpandas/Reinforcement-Learning-Papers

    📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.

  • Cryolite/kanachan

    A Japanese (Riichi) Mahjong AI Framework

    Language:Python320152840
  • nikhilbarhate99/min-decision-transformer

    Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym

    Language:Python2792728
  • instadeepai/og-marl

    Datasets with baselines for Offline MARL.

    Language:Python17871714
  • polixir/OfflineRL

    A collection of offline reinforcement learning algorithms.

    Language:Python17541020
  • nissymori/JAX-CORL

    Clean single-file implementation of offline RL algorithms in JAX

    Language:Python1544254
  • BY571/CQL

    PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.

    Language:Python1413723
  • polixir/NeoRL

    Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets

    Language:Python11751212
  • ZhengyaoJiang/latentplan

    Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.

    Language:Python1052212
  • ZhengYinan-AIR/FISOR

    [ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"

    Language:Python94377
  • snu-mllab/EDAC

    Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)

    Language:Python76226
  • DHDev0/Stochastic-muzero

    Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.

    Language:Python7141010
  • ryanxhr/POR

    [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"

    Language:Python58227
  • tinkoff-ai/ReBRAC

    Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC

    Language:Jupyter Notebook55206
  • tinkoff-ai/sac-rnd

    Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023

    Language:Python53305
  • Howuhh/sac-n-jax

    Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch

    Language:Python52113
  • LanqingLi1993/FOCAL-ICLR

    Code for FOCAL Paper Published at ICLR 2021

    Language:Python523119
  • snu-mllab/DPPO

    Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)

    Language:Python42221
  • ryanxhr/DWBC

    [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"

    Language:Python34122
  • ZhengYinan-AIR/OMIGA

    [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization"

    Language:Python33283
  • holarissun/RewardShifting

    Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL

    Language:Python29203
  • LoopMind-AI/loopquest

    A Production Tool for Embodied AI

    Language:Python29221
  • sail-sg/rosmo

    Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023

    Language:Python29630
  • BayesBrain/Habi

    Official PyTorch Implementation of Habitizing Diffusion Planning for Efficient and Effective Decision Making

    Language:Python27
  • YangRui2015/AWGCSL

    Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.

    Language:Python26102
  • kschweig/OfflineRL

    Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning

    Language:Jupyter Notebook25116
  • ltlhuuu/A2PR

    [ICML 2024] The offical implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage-guided policy regularization method, in Pytorch

    Language:Python25200
  • xionghuichen/MAPLE

    The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)

    Language:Python25305
  • yudasong/HyQ

    Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.

    Language:Python24123
  • zaiyan-x/RFQI

    Implementation of Robust Reinforcement Learning using Offline Data [NeurIPS'22]

    Language:Python24233
  • ltlhuuu/PSEC

    [ICLR 2025] The offical implementation of "PSEC: Skill Expansion and Composition in Parameter Space", a new framework designed to facilitate efficient and flexible skill expansion and composition, iteratively evolve the agents' capabilities and efficiently address new challenges

    Language:Python231
  • Manchery/iql-pytorch

    Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL

    Language:Python23101
  • DesikRengarajan/FEDORA

    [NeurIPS 2024] Code for Federated Ensemble-Directed Offline Reinforcement Learning

    Language:Python22104