offline-rl

There are 51 repositories under offline-rl topic.

opendilab/DI-engine
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
Language:Python3.2k 23 215381
takuseno/d3rlpy
An offline deep reinforcement learning library
Language:Python1.4k 27 349244
hanjuku-kaso/awesome-offline-rl
An index of algorithms for offline reinforcement learning (offline-rl)
948 45 187
mbreuss/diffusion-literature-for-robotics
Summary of key papers and blogs about diffusion models to learn about the topic. Detailed list of all published diffusion robotics papers.
728 21 039
yingchengyang/Reinforcement-Learning-Papers
Related papers for reinforcement learning, including classic papers and latest papers in top conferences
336 17 024
Farama-Foundation/Minari
A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities
Language:Python324 12 5947
Cryolite/kanachan
A Japanese (Riichi) Mahjong AI Framework
Language:Python295 17 2840
opendilab/DI-engine-docs
DI-engine docs (Chinese and English)
Language:Python290 2 362
Sea-Snell/Implicit-Language-Q-Learning
Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"
Language:Python203 6 817
liuzuxin/OSRL
🤖 Elegant implementations of offline safe RL algorithms in PyTorch
Language:Python181 5 2112
Shanghai-Digital-Brain-Laboratory/BDM-DB1
A large-scale multi-modal pre-trained model
Language:Python129 4 29
hakuhodo-technologies/scope-rl
SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection
Language:Python118 5 911
nissymori/JAX-CORL
Clean single-file implementation of offline RL algorithms in JAX
Language:Python118 4 232
denisyarats/exorl
ExORL: Exploratory Data for Offline Reinforcement Learning
Language:Python105 3 49
opendilab/GenerativeRL
Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).
Language:Python104 3 08
takuseno/minerva
An out-of-the-box GUI tool for offline deep reinforcement learning
Language:JavaScript99 6 29
Div99/XQL
Extreme Q-Learning: Max Entropy RL without Entropy
Language:Python82 3 510
liuzuxin/DSRL
🔥 Datasets and env wrappers for offline safe reinforcement learning
Language:Python82 2 125
nakamotoo/Cal-QL
official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
Language:Python81 4 95
LAMDA-RL/OfflineRL-Lib
Benchmarked implementations of Offline RL Algorithms.
Language:Python67 3 57
callmespring/RL-short-course
Reinforcement Learning Short Course
Language:Jupyter Notebook58 3 018
MLforHealth/rl_representations
Learning representations for RL in Healthcare under a POMDP assumption
Language:Jupyter Notebook51 5 511
young-geng/JaxCQL
Conservative Q learning in Jax
Language:Python51 3 46
BY571/Implicit-Q-Learning
PyTorch implementation of the implicit Q-learning algorithm (IQL)
Language:Python42 2 44
holarissun/Prompt-OIRL
code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning
Language:Python35 3 45
junming-yang/mopo
Model-based Offline Policy Optimization re-implement all by pytorch
Language:Python30 2 26
XanderJC/medkit-learn
The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation (NeurIPS 2021) by Alex J. Chan, Ioana Bica, Alihan Huyuk, Daniel Jarrett, and Mihaela van der Schaar.
Language:Python29 3 11
sail-sg/rosmo
Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023
Language:Python28 6 30
tinkoff-ai/eop
Code for the paper "Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters", ICML 2022
Language:Jupyter Notebook28 2 02
AIR-DI/D2C
D2C(Data-driven Control Library) is a library for data-driven control based on reinforcement learning.
Language:Python23 0 01
hari-sikchi/offline_rl
Pytorch implementation of state-of-the-art offline reinforcement learning algorithms.
Language:Python22 3 12
xionghuichen/MAPLE
The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)
Language:Python22 3 05
christopher-beckham/coms-are-energy-models
Official code for paper: Conservative objective models are a special kind of contrastive divergence-based energy model
Language:Jupyter Notebook14 1 00
YiqinYang/VEM
Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.09796)
Language:Python12 1 14
samholt/NeuralLaplaceControl
Neural Laplace Control for Continuous-time Delayed Systems - an offline RL method combining Neural Laplace dynamics model and MPC planner to achieve near-expert policy performance in environments with irregular time intervals and an unknown constant delay.
Language:Python11 4 05
amazon-science/cdc-batch-rl
Code for Continuous Doubly Constrained Batch Reinforcement Learning, NeurIPS 2021.
Language:Python8 2 11

offline-rl

opendilab/DI-engine

takuseno/d3rlpy

hanjuku-kaso/awesome-offline-rl

mbreuss/diffusion-literature-for-robotics

yingchengyang/Reinforcement-Learning-Papers

Farama-Foundation/Minari

Cryolite/kanachan

opendilab/DI-engine-docs

Sea-Snell/Implicit-Language-Q-Learning

liuzuxin/OSRL

Shanghai-Digital-Brain-Laboratory/BDM-DB1

hakuhodo-technologies/scope-rl

nissymori/JAX-CORL

denisyarats/exorl

opendilab/GenerativeRL

takuseno/minerva

Div99/XQL

liuzuxin/DSRL

nakamotoo/Cal-QL

LAMDA-RL/OfflineRL-Lib

callmespring/RL-short-course

MLforHealth/rl_representations

young-geng/JaxCQL

BY571/Implicit-Q-Learning

holarissun/Prompt-OIRL

junming-yang/mopo

XanderJC/medkit-learn

sail-sg/rosmo

tinkoff-ai/eop

AIR-DI/D2C

hari-sikchi/offline_rl

xionghuichen/MAPLE

christopher-beckham/coms-are-energy-models

YiqinYang/VEM

samholt/NeuralLaplaceControl

amazon-science/cdc-batch-rl