Can't load GPT3_XL
MK096 opened this issue · 7 comments
Hi All,
I downloaded the model from
https://the-eye.eu/public/AI/gptneo-release/GPT3_XL/
after which i changed model_path in config.json to:
"model_path" : "C:\Users\GPT_NEO_2\GPT3_XL"
Whenever i run the following code:
model = GPTNeoForCausalLM.from_pretrained("C:\Users\GPT_NEO_2\GPT3_XL")
i get an error:
f"Error no file named {[WEIGHTS_NAME, TF2_WEIGHTS_NAME, TF_WEIGHTS_NAME + '.index', FLAX_WEIGHTS_NAME]} found in "
OSError: Error no file named ['pytorch_model.bin', 'tf_model.h5', 'model.ckpt.index', 'flax_model.msgpack'] found in directory C:\Users\GPT_NEO_2\GPT3_XL or from_tf
and from_flax
set to False.
and while running :
generator = pipeline('text-generation', model="C:\Users\GPT_NEO_2\GPT3_XL")
i get following error:
f"Unrecognized model in {pretrained_model_name_or_path}. "
I have the latest TF and torch (both cpu).
Thanks
Please double check that you have downloaded all the files from the eye. There is a .index
file on the eye.
Please double check that you have downloaded all the files from the eye. There is a
.index
file on the eye.
Yes, that is there along with config. Json and several checkpoints..
Also, how do I make use of the checkpoints?
Please double check that you have downloaded all the files from the eye. There is a
.index
file on the eye.There is no .index files. I have attached the files that i got
In that case it seems like you failed to download everything. If you visit this link you'll see that there is a .index
file there.
Please double check that you have downloaded all the files from the eye. There is a
.index
file on the eye.There is no .index files. I have attached the files that i got
In that case it seems like you failed to download everything. If you visit this link you'll see that there is a
.index
file there.
I re downloaded everything. (along with index file)
While running the model i get following error now:
Argument not a list with same length as devices
Great! You are now officially out of our hands... this is a GPU configuration issue on your end. I recommend checking out multiGPU training documentation for whatever framework you are using. This issue may also be helpful.
Did anyone ever solve this issue? It doesn't make sense for it to be a GPU configuration issue if we have a standard setup with only 1 commonly used NVIDIA GPU. I'm using NVIDIA 2060S on Windows 10. Why is there a list of 255 numbers?
ValueError: Argument not a list with same length as devices arg=[0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255] devices=['device:GPU:0']