A toy implementation of SPIN(Self-Play Fine-Tuning)
Primary LanguagePython
No issues in this repository yet.