Pinned Repositories
llamafile
Distribute and run LLMs with a single file. Rubra customized version that adds grammar to chat completion api.
rubra
Open Weight, tool-calling LLMs
rubra-embed-benchmark
rubra-tools
tools.cpp
LLM inference in C/C++, further modified for Rubra function calling models
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs. Extended for Rubra function calling models
rubra-ai's Repositories
rubra-ai/rubra
Open Weight, tool-calling LLMs
rubra-ai/tools.cpp
LLM inference in C/C++, further modified for Rubra function calling models
rubra-ai/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs. Extended for Rubra function calling models
rubra-ai/llamafile
Distribute and run LLMs with a single file. Rubra customized version that adds grammar to chat completion api.
rubra-ai/rubra-embed-benchmark
rubra-ai/rubra-tools