Issues
- 1
I have the windows problem
#51 opened by albertimff - 1
issue
#50 opened by grattitude - 4
Problems with installation instructions
#4 opened by jonpincus - 0
I run the test program ,but why my “example_samples.jsonl_results.jsonl” file displayed “all passed” ?
#49 opened by CHfaithwy - 4
Evaluation doesn't work on Windows
#45 opened by peter-ch - 0
- 0
About how to use
#47 opened by RIKI0913 - 1
- 0
Tests in task 67 are impossible to satisfy
#44 opened by jack-jjm - 0
Task 145 makes no sense
#43 opened by jack-jjm - 9
evaluate_functional_correctness can't run
#18 opened by BoyuanJackChen - 6
AttributeError: Can't pickle local object 'check_correctness.<locals>.unsafe_execute'
#27 opened by tianzhaotju - 0
Removed deprecated fields.
#42 opened by Youniqueli - 0
Error in tests for HumanEval/163
#41 opened by mono-jiarui - 0
Evaluations timing out
#40 opened by antonkarlsson1 - 0
When running the code generated by the model, an error occurs: failed: No module named 'scipy'
#39 opened by HangXue-lab - 1
HE vAL
#37 opened by Youniqueli - 3
Why pass@k =1.0? use the "evaluate_functional_correctness data/example_samples.jsonl --problem_file=data/example_problem.jsonl"
#16 opened by Smithol - 0
bug in estimate_pass_at_k
#35 opened by sidaw - 1
Entry point error while Installing the package.
#12 opened by Ali1858 - 0
Why do I use the phi model to output the same result for all samples at a temperature of 0.8?
#34 opened by Mrzhang-dada - 0
Where to find the leaderboard?
#33 opened by zhimin-z - 1
I do not understand how to run human eval
#32 opened by teknium1 - 1
- 3
Evaluation.py failing on KeyError: 'test/0'
#10 opened by briviere - 1
Error in the prompt of HumanEval/47
#6 opened by Kim-mins - 1
file missing
#24 opened by shuaiwang2022 - 1
Codex Training Data
#19 opened by zachares - 1
# Realizar ajustes en la arquitectura
#26 opened by D0yi - 0
Error in canonical solution 95 check_dict_case
#22 opened by PootieT - 1
- 2
- 0
Finetuning With HumanEval
#17 opened by MT010104 - 0
Why not allow contribution?
#15 opened by rafidka - 0
pass@k on filtered samples
#13 opened by henryhungle - 2
Prompt used in APPS
#11 opened by henryhungle - 1
Question about the generate_one_completion
#5 opened by xupeng1910 - 1
execution.py bug request
#3 opened by rainmaker712