decision-transformers
There are 18 repositories under decision-transformers topic.
opendilab/awesome-decision-transformer
A curated list of Decision Transformer resources (continually updated)
tomekkorbak/pretraining-with-human-feedback
Code accompanying the paper Pretraining Language Models with Human Preferences
elicassion/StARformer
[ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.
maohangyu/TIT_open_source
The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"
ml-jku/L2M
Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)
etaoxing/multigame-dt
Implementation of Multi-Game Decision Transformers in PyTorch
ml-jku/LRAM
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
Tobi-Tob/CityLearnTransformer
This repository is used to generate data and evaluate Decision Transformers on the CityLearn (Challenge 2022) environment for urban energy management
ml-jku/RA-DT
Retrieval-Augmented Decision Transformer: External Memory for In-context RL
kwk2696/sb3-jax-haiku
stable-baselines with JAX & Haiku
hukz18/DeFog
Code release for the ICLR 2023 conference paper "DeFog: Decision Transformer under Random Frame Dropping"
vibalcam/deep-rl-supertux-race
Deep Reinforcement Learning AI to play the SuperTuxKart race game using a Decision Transformer
chandar-lab/SubGoal_Distillation_LLM
Code for paper Sub-goal Distillation: A Method to Improve Small Language Agents, accepted at CoLLAs 2024.
tedtedtedtedtedted/Solve-Rubiks-Cube-Via-Transformer
Applying regular transformer and decision transformer on solving the Rubik's cube. A paper is also written to document the results
ethanthoma/decision-transformer
Implementation of the decision tranformer paper in tinygrad
bhaveshgawri/decision-transformer-transfer-learning
some experiments with training and fine-tuning decision transformer
JVP15/Pilaf
Pilaf is a Backgammon agent using Decision Transformers and Offline RL
rikulehtonen/ATAG
ATAG - Automated Test Automation Generation