Pinned Repositories
TPLD
trl
Train transformer language models with reinforcement learning.
ToolBench
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
gorilla
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
Confucius
Code for `Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum`
lucenzhong's Repositories
lucenzhong/TPLD
lucenzhong/trl
Train transformer language models with reinforcement learning.