/llama-2-function

Cog model for llama 2 inference with jsonschema

Primary LanguagePython

Llama 2 function calling

Run on Replicate

Llama 2 with grammar-based decoding (provided by llama.cpp) for constraining the output to a json schema.