Kavka1

Fudan UniversityShanghai

Kavka1's Stars

myscience/open-genie
Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).
Language:Python14015
ml-jku/helm
Language:Python534
hanjuku-kaso/awesome-offline-rl
An index of algorithms for offline reinforcement learning (offline-rl)
96788
apexrl/Diff4RLSurvey
This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Reinforcement Learning: A Survey"
52724
zhaoyi11/tcrl
Language:Jupyter Notebook233
frt03/mxt_bench
A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation (ICLR2023)
Language:Python134
Dahoas/reward-modeling
Language:Python9615
lucidrains/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Language:Python7.8k676
123penny123/Awesome-LLM-RL
A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.
36220
GT-RIPL/Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
3.5k278
microsoft/RLHF-APA
RL algorithm: Advantage induced policy alignment
Language:Python657
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Language:Python4.6k478
younggyoseo/MV-MWM
Language:Python563
stepjam/ARM
Q-attention (within the ARM system) and coarse-to-fine Q-attention (within C2F-ARM system).
Language:Python17431
penn-pal-lab/LIV
Official repository for "LIV: Language-Image Representations and Rewards for Robotic Control" (ICML 2023)
Language:Python1038
suraj-nair-1/lorel
Language:Python386
facebookresearch/vip
Official repository for "VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training"
Language:Python15019
luchris429/purejaxrl
Really Fast End-to-End Jax RL Implementations
Language:Python83969
stepjam/RLBench
A large-scale benchmark and learning environment.
Language:Python1.3k264
vikashplus/robohive
A unified framework for robot learning
Language:Python55587
HybridRobotics/GenLoco
Language:Python25429
awarelab/continual_world
Language:Python9017
Farama-Foundation/D4RL
A collection of reference environments for offline reinforcement learning
Language:Python1.4k291
martius-lab/GateL0RD
Code for our NeurIPS 2021 paper: Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains
Language:Python223
RajGhugare19/dreamerv2
Pytorch implementation of Dreamer-v2: Visual Model Based RL Algorithm.
Language:Python25343
RajGhugare19/alm
Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective
Language:Python797
nicklashansen/tdmpc
Code for "Temporal Difference Learning for Model Predictive Control"
Language:Python40961
jmcoholich/isaacgym
Language:Python253
facebookresearch/mbrl-lib
Library for Model Based RL
Language:Python988159
jsikyoon/dreamer-torch
Pytorch version of Dreamer, which follows the original TF v2 codes.
Language:Python12225

Kavka1

Kavka1's Stars

myscience/open-genie

ml-jku/helm

hanjuku-kaso/awesome-offline-rl

apexrl/Diff4RLSurvey

zhaoyi11/tcrl

frt03/mxt_bench

Dahoas/reward-modeling

lucidrains/PaLM-rlhf-pytorch

123penny123/Awesome-LLM-RL

GT-RIPL/Awesome-LLM-Robotics

microsoft/RLHF-APA

CarperAI/trlx

younggyoseo/MV-MWM

stepjam/ARM

penn-pal-lab/LIV

suraj-nair-1/lorel

facebookresearch/vip

luchris429/purejaxrl

stepjam/RLBench

vikashplus/robohive

HybridRobotics/GenLoco

awarelab/continual_world

Farama-Foundation/D4RL

martius-lab/GateL0RD

RajGhugare19/dreamerv2

RajGhugare19/alm

nicklashansen/tdmpc

jmcoholich/isaacgym

facebookresearch/mbrl-lib

jsikyoon/dreamer-torch