IggShaman opened this issue 5 months ago · 0 comments
In the early training stages, the example generation may produce out-of-bound tokens. The tiktoken.decode panics on those, so make sure it doesn't see them.
PR: #80