/bigcodebench-annotation

A Rigorous Benchmark for Code Generation with Realistic Constraints in the Wild

Primary LanguagePythonApache License 2.0Apache-2.0

Benchmarking Programming Agents for Realistic Function Calling

WIP. Coming soon...