svilupp/Julia-LLM-Leaderboard
Provides a platform for the Julia community to compare AI models' abilities in generating syntactically correct Julia code, featuring structured tests and automated evaluations for easy and collaborative benchmarking.
HTMLMIT
Issues
- 1
- 0
Unregistered package in Project.toml
#10 opened by ceferisbarov - 1
[FR] Add more test cases
#5 opened by svilupp - 0
[FR] Add benchmark for other applications
#6 opened by svilupp - 3
Details about the Yi Chat model?
#1 opened by findmyway