/flask-llama2

A small python sample showing how to serve a 3b llama2 model for GPU using flask

Primary LanguagePython

No issues in this repository yet.