spin-toy: A Python repository from martyn

This is a toy implementation of SPIN from "Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models"

The main part is the loss function.

## License

MIT

## References

* https://arxiv.org/pdf/2401.01335.pdf

martyn/spin-toy