/sft_mathgpt2

Supervised Fine tuning using TRL library

Primary LanguageJupyter Notebook

Watchers