Kavka1's Stars
myscience/open-genie
Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).
ml-jku/helm
hanjuku-kaso/awesome-offline-rl
An index of algorithms for offline reinforcement learning (offline-rl)
apexrl/Diff4RLSurvey
This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Reinforcement Learning: A Survey"
zhaoyi11/tcrl
frt03/mxt_bench
A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation (ICLR2023)
Dahoas/reward-modeling
lucidrains/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
123penny123/Awesome-LLM-RL
A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.
GT-RIPL/Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
microsoft/RLHF-APA
RL algorithm: Advantage induced policy alignment
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
younggyoseo/MV-MWM
stepjam/ARM
Q-attention (within the ARM system) and coarse-to-fine Q-attention (within C2F-ARM system).
penn-pal-lab/LIV
Official repository for "LIV: Language-Image Representations and Rewards for Robotic Control" (ICML 2023)
suraj-nair-1/lorel
facebookresearch/vip
Official repository for "VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training"
luchris429/purejaxrl
Really Fast End-to-End Jax RL Implementations
stepjam/RLBench
A large-scale benchmark and learning environment.
vikashplus/robohive
A unified framework for robot learning
HybridRobotics/GenLoco
awarelab/continual_world
Farama-Foundation/D4RL
A collection of reference environments for offline reinforcement learning
martius-lab/GateL0RD
Code for our NeurIPS 2021 paper: Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains
RajGhugare19/dreamerv2
Pytorch implementation of Dreamer-v2: Visual Model Based RL Algorithm.
RajGhugare19/alm
Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective
nicklashansen/tdmpc
Code for "Temporal Difference Learning for Model Predictive Control"
jmcoholich/isaacgym
facebookresearch/mbrl-lib
Library for Model Based RL
jsikyoon/dreamer-torch
Pytorch version of Dreamer, which follows the original TF v2 codes.