Pinned Repositories
ToolBench
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
atari-dqn
Implementation Deep Q Network to play Atari Games
dialogue-offline-rl
unsloth
Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory
StableToolBench
A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.
LDST
EMNLP 2023
Taeyoung-Jang's Repositories
Taeyoung-Jang/atari-dqn
Implementation Deep Q Network to play Atari Games
Taeyoung-Jang/dialogue-offline-rl
Taeyoung-Jang/unsloth
Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory