svilupp/Julia-LLM-Leaderboard

Provides a platform for the Julia community to compare AI models' abilities in generating syntactically correct Julia code, featuring structured tests and automated evaluations for easy and collaborative benchmarking.

HTMLMIT

Issues

ERROR: LoadError: UndefVarError: `run_code_blocks` not defined
#15 opened 8 months ago by ceferisbarov
1
Unregistered package in Project.toml
#10 opened 8 months ago by ceferisbarov
0
[FR] Add more test cases
#5 opened 9 months ago by svilupp
1
[FR] Add benchmark for other applications
#6 opened 9 months ago by svilupp
0
Details about the Yi Chat model?
#1 opened 10 months ago by findmyway
3