How to Load Model from pytorch_model.bin into Trained Model for Text Generation?

Question

How to Load Model from pytorch_model.bin into Trained Model for Text Generation?

Opened this issue 5 months ago · 0 comments

Hello,

I've recently trained a model using GPT-NeoX and packed the checkpoint global_step1000 into a pytorch_model.bin file using the zero_to_fp32.py script. However, I'm having trouble figuring out how to load this file into the trained model for text generation.

I have reviewed the GPT-NeoX documentation and code, but I still don't understand the specific steps required. Any example code or detailed instructions would be greatly appreciated.

Thank you very much!