microsoft/Llama-2-Onnx

Cannot access new optimized model variants despite having permissions for existing variants

AustinDoolittle opened this issue · 1 comments

I'm attempting to access the newly optimized models created on the main-CUDA-CPU branch, however I'm getting a 404 error (indicating lack of permissions). I have been granted access to the model weights for the unoptimized version of the models in the main branch, are there plans to grandfather in those permission to the newly optimized models? Or is there some new process to request access to the optimized models?

Hello, the plan is to grandfather every user into the new sub-modules. We have added an ETA warning for those new branches (CUDA/CPU and DML) to help ease the confusion. If usage of the models is urgent each branch has directions to the open-source methods of producing the ONNX assets.