Issues
- 5
Task HumanEval/092 has contradictory tests in Rust
#142 opened by geajack - 1
leetcode dataset not found
#134 opened by way2swaggy - 1
Citation for the LeetCode Dataset
#140 opened by JJGO - 2
- 2
Quantized model is not supported - Calling cuda() is not supported for 4-bit or 8-bit quantized models
#137 opened by Santhoshkumar-p - 0
- 2
code generated with wrong end of string place
#128 opened by tedvuminhhuy - 2
padding left some token causing compile error
#127 opened by tedvuminhhuy - 1
- 4
- 0
All non-multiline commented prompts currently broken
#114 opened by cassanof - 0
R prompts are currently broken
#111 opened by arjunguha - 1
Turn translator into a library
#82 opened by arjunguha - 2
Could I get all statistics?
#91 opened by sh0416 - 1
- 3
Error evaluating TS/Java
#89 opened by memray - 4
- 15
Add HumanEval+ tests
#62 opened by Randl - 3
Scala tests comparing optional value
#64 opened by PootieT - 3
Small issues with Swift prompt signatures
#63 opened by PootieT - 1
C# test sequence equality
#71 opened by PootieT - 0
PHP test indexed array comparison
#72 opened by PootieT - 4
Perl Unit test comparing float values
#67 opened by PootieT - 5
Perl Unit test when expecting "False/0" output
#66 opened by PootieT - 13
Environment for evaluating C#
#34 opened by memray - 4
Racket unit test numerical equivalence
#60 opened by PootieT - 8
R unit test comparison between integer and double
#55 opened by PootieT - 7
R unit tests atomic vector comparison
#50 opened by PootieT - 3
C++ test float comparison
#51 opened by PootieT - 2
Java transpiled test failing with optional output
#47 opened by PootieT - 1
Stop tokens for Java do not allow completions that produce several top-level methods.
#59 opened by PootieT - 1
Reported pass@k silently wrong for n<k
#40 opened by daniel-vainsencher - 5
- 1
Support execution for the FIM benchmarks
#29 opened by arjunguha - 1
- 2
- 5
- 1
- 1
Better isolation for evaluation
#19 opened by arjunguha - 0
- 0
Bad filenames when using gz
#16 opened by arjunguha