WorkBench: a Benchmark Dataset for Agents in a Realistic Workplace Setting. Under review
Primary LanguagePython
No one’s watching this repository yet.