/demo-codellama-7b-instruct-gptq

This is a CodeLlama-7B-Instruct-GPTQ model starter template from Banana.dev that allows on-demand serverless GPU inference.

Primary LanguagePython

Stargazers