Pinned Repositories
GraphProgramBench
A benchmark for evaluating language model's ability to navigate within HybridAGI programs.
GraphProgramBench
A benchmark for evaluating language model's ability to navigate within HybridAGI programs.
PartieIA
abtblank's Repositories
abtblank/GraphProgramBench
A benchmark for evaluating language model's ability to navigate within HybridAGI programs.