finetuning-rl
There are 2 repositories under finetuning-rl topic.
promptslab/LLMtuner
FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)
ZishunYu/Actor-Critic-Alignment
Implementation of ``Actor-Critic Alignment for Offline-to-Online Reinforcement Learning''