salesforce/CodeRL
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).
PythonBSD-3-Clause
Issues
- 2
CodeT5 input for APPS/MBPP problems
#27 opened by ysymyth - 1
- 0
How to arrange the file ‘models’???
#58 opened by Mucalinda2436 - 3
Critic Training pre-processing steps
#47 opened by xylankant - 0
When will you release the implementation details of your critic sampling procedure please?
#57 opened by JulietLJY - 0
- 0
problems in the critic model results
#50 opened by Juanting-Xu - 8
- 11
Datasets for train_actor_rl.sh
#24 opened by doviettung96 - 0
- 0
documentation request for test_one_solution.py
#33 opened by ziwenyd - 0
- 2
what is the super-parameters for RL training
#29 opened by Zyq-scut - 2
Question about the max-pooling operation.
#31 opened by rongaoli - 1
exception of run_unit_tests.sh
#19 opened by qfzhu - 9
Finetuned model checkpoints
#2 opened by boblee22 - 1
- 1
Sample Temperature
#22 opened by MrBlack0220 - 1
Question about the input of the critic model
#28 opened by Zyq-scut - 2
RL with execution-based reward
#26 opened by ysymyth - 1
Performance Results on HumanEval
#25 opened by htcml - 0
problem about run_unit_tests.sh
#23 opened by MrBlack0220 - 4
Run generate.py with the CodeT5-large trained on the ground-truth programs
#18 opened by doviettung96 - 1
- 1
- 1
Question about pre-training process
#3 opened by natedingyifeng - 1
Politically correct license description
#1 opened by xloem