karpathy/build-nanogpt

How to support padding in the train dataset for training ?

mrhimanshu opened this issue · 2 comments

How to support padding in the train dataset for training ?

I would just add it in the fineweb.py script when you are tokenizing the rows.

@mrhimanshu sorry forgot to tag you