Community-maintained benchmark repo for common BAML tasks such as tool-calling, classification, and more.
BoundaryML/tc-benchmark
Benchmarks to help find the best, fastest and cheapest open models for structured output and tool use
Benchmarks to help find the best, fastest and cheapest open models for structured output and tool use