openpsi-project/ReaLHF
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
PythonApache-2.0
Issues
- 2
- 8
grpo has not prm
#79 opened by yiyepiaoling0715 - 4
关于代码中数据传输、参数传输等的若干疑问
#82 opened by metaqiang - 5
How to debug remote subprocess?
#81 opened by metaqiang - 1
Suggestion for Fine-Grained Batch Control e.g `per_device_train_batch_size` or `mini_batch_size`
#80 opened by dechunwang - 7
有多机分布式任务的example吗?
#24 opened by PKUFlyingPig - 0
DeepSpeed v0.14.0 is not compatible with the latest Nvidia PyTorch docker image.
#77 opened by garrett4wade - 0
- 0
Automatic importing can be problematic if the parent directory contains "realhf".
#58 opened by garrett4wade - 5
- 2
allocation_mode
#45 opened by AIRobotZhang - 2
TransformerConfig() takes no arguments
#46 opened by AIRobotZhang - 0
Issues for the generation experiment.
#59 opened by garrett4wade - 2
- 1
No module named 'colorlog'
#41 opened by AIRobotZhang - 2
如何推理
#40 opened by AIRobotZhang - 1
worker ERROR: Worker encountered error 'id'
#43 opened by AIRobotZhang - 1
Loading parameters takes exceptionally long time.
#18 opened by nuzant - 1
- 3
你们论文中的实验有认真调过其他框架的性能么?
#10 opened by hijkzzz