Set max_size to 128 but use 512 tokens
adivoj opened this issue · 2 comments
adivoj commented
Hi, great work I must say!
I understand that books can be fed into trainer while having the trainer token max size set to max but is it possible to set it to a lower number? My input is ca. 400 tokens but I'd like to speed up training by shortening it.
Thanks!
urialon commented
Hi @adivoj ,
Thank you for your interest in our work and for your kind words!
I'm not sure I understand your question.
- When we train on books, we train on the first 16,000 tokens, but test on the entire 500,000+ tokens at test time.
- If your inputs are 400 tokens long, they can fit in the standard context window of all LMs. Do you need Unlimiformer for them?
Best,
Uri
adivoj commented
Hi,
Thanks @urialon, thing is that I didn't quite understand the possibilities. Now It's more clear. I though that you found a way to pack them to a smaller size which would give faster training time. No worries, unlimiformer will certainly come to a good use at some other time.
Best regards,
adivoj