How to Load Model from pytorch_model.bin into Trained Model for Text Generation?
Opened this issue · 0 comments
lieh1203 commented
Hello,
I've recently trained a model using GPT-NeoX and packed the checkpoint global_step1000 into a pytorch_model.bin file using the zero_to_fp32.py script. However, I'm having trouble figuring out how to load this file into the trained model for text generation.
I have reviewed the GPT-NeoX documentation and code, but I still don't understand the specific steps required. Any example code or detailed instructions would be greatly appreciated.
Thank you very much!