EleutherAI/gpt-neo

Can't load GPT3_XL

MK096 opened this issue · 7 comments

MK096 commented

Hi All,
I downloaded the model from
https://the-eye.eu/public/AI/gptneo-release/GPT3_XL/

after which i changed model_path in config.json to:
"model_path" : "C:\Users\GPT_NEO_2\GPT3_XL"

Whenever i run the following code:
model = GPTNeoForCausalLM.from_pretrained("C:\Users\GPT_NEO_2\GPT3_XL")

i get an error:
f"Error no file named {[WEIGHTS_NAME, TF2_WEIGHTS_NAME, TF_WEIGHTS_NAME + '.index', FLAX_WEIGHTS_NAME]} found in "
OSError: Error no file named ['pytorch_model.bin', 'tf_model.h5', 'model.ckpt.index', 'flax_model.msgpack'] found in directory C:\Users\GPT_NEO_2\GPT3_XL or from_tf and from_flax set to False.

and while running :
generator = pipeline('text-generation', model="C:\Users\GPT_NEO_2\GPT3_XL")

i get following error:
f"Unrecognized model in {pretrained_model_name_or_path}. "

I have the latest TF and torch (both cpu).

Thanks

I have aloso attached my config.json
config

Please double check that you have downloaded all the files from the eye. There is a .index file on the eye.

MK096 commented

Please double check that you have downloaded all the files from the eye. There is a .index file on the eye.

Yes, that is there along with config. Json and several checkpoints..
Also, how do I make use of the checkpoints?

MK096 commented

Please double check that you have downloaded all the files from the eye. There is a .index file on the eye.

There is no .index files. I have attached the files that i got
gpt_files

Also, how do I make use of the weights and checkpoints?

Please double check that you have downloaded all the files from the eye. There is a .index file on the eye.

There is no .index files. I have attached the files that i got

In that case it seems like you failed to download everything. If you visit this link you'll see that there is a .index file there.

MK096 commented

Please double check that you have downloaded all the files from the eye. There is a .index file on the eye.

There is no .index files. I have attached the files that i got

In that case it seems like you failed to download everything. If you visit this link you'll see that there is a .index file there.

I re downloaded everything. (along with index file)
While running the model i get following error now:

Argument not a list with same length as devices

#227 (comment)

Great! You are now officially out of our hands... this is a GPU configuration issue on your end. I recommend checking out multiGPU training documentation for whatever framework you are using. This issue may also be helpful.

Did anyone ever solve this issue? It doesn't make sense for it to be a GPU configuration issue if we have a standard setup with only 1 commonly used NVIDIA GPU. I'm using NVIDIA 2060S on Windows 10. Why is there a list of 255 numbers?

ValueError: Argument not a list with same length as devices arg=[0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255] devices=['device:GPU:0']