/demo-firefly-llama2-13b-v1.2

This is a Firefly-Llama2-13B-v1.2-GPTQ model starter template from Banana.dev that allows on-demand serverless GPU inference.

Primary LanguagePython

No issues in this repository yet.