Issues
- 1
RRHF with Online Sampling
#57 opened by sqqiao - 0
resize embedding after add_special_tokens
#56 opened by Switchsyj - 0
Runtime error:数据类型报错
#55 opened by sqqiao - 1
- 6
算loss的时候求均值的时候是不是可以优化
#51 opened by shyoulala - 2
bug 计算sft损失的时候
#48 opened by shyoulala - 3
如果我想将模型更改为baichuan2-7b-chat,需要做哪些方面的变动?
#52 opened by IT-five - 4
loss的代码关于batch size的处理有bug。
#23 opened by echoht - 0
请问基于Vicuna测试集的比较是如何进行比较的?
#53 opened by IT-five - 1
- 0
Label Shifts
#49 opened by yafuly - 1
- 3
关于alpaca-7B和LLaMA-7B
#47 opened by NEUBuffett - 5
dummy_target的请教
#45 opened by xunfengzhangyang - 2
有关IMDB数据集的问题
#46 opened by stevie1023 - 1
dummy_target的请教
#44 opened by xunfengzhangyang - 11
加载模型的问题
#43 opened by LiangZhuuu - 3
损失函数
#42 opened by xiayouhong - 1
训练过程OOM的问题
#41 opened by Guochry - 24
can RRHF train on v100 32G?
#20 opened by akk-123 - 4
Wombat与RRHF
#40 opened by Guochry - 6
The generation config for evaluation
#39 opened by stevie1023 - 2
在单卡A100上训练出现torch.distributed.elastic.multiprocessing.api.SignalException: Process 2920830 got signal: 1
#35 opened by Zhang-Each - 3
labels != -100的作用是什么
#38 opened by LSX-Sneakerprogrammer - 11
The size of tensor a (8) must match the size of tensor b (2) at non-singleton dimension 1
#36 opened by ZJXNEFU - 8
- 4
NameError: name 'save_fsdp_model' is not defined
#33 opened by ZJXNEFU - 2
评估方法与位置有很大关系
#32 opened by xiaoyuan1996 - 5
- 7
对于重复score答案样本的处理疑问
#25 opened by yanhan19940405 - 15
wombat-7B的输出异常
#21 opened by lx86110 - 2
- 1
期待LoRA或ptuning
#31 opened by Noyce765103 - 1
How to use it. Is there some code examples?
#28 opened by Mr-IT007 - 7
- 2
一些训练细节
#27 opened by xiaoyuan1996 - 1
training with my own gpt2
#22 opened by dyyzhmm - 2
PPO implementation
#19 opened by yuzc19 - 4
Wombat-7B,Wombat-7B-gpt4 and ChatGPT Results on Comparison based on Vicuna test set, evaluation by gpt-4.
#18 opened by onlyfish79 - 12
有关训练模型细节
#17 opened by yanhan19940405 - 1
Results on Comparison based on Vicuna test set
#16 opened by LeeShiyang - 1
Why use HingeLoss instead of BPRLoss ?
#15 opened by KID-22 - 10
single_sentence_inference output is empty
#14 opened by better629 - 0
We are trying to evaluate Wombat on Vicuna test set, but we do not have GPT4 API.
#11 opened by GanjinZero - 4
This loss seems to consume a lot of memory.
#13 opened by piekey1994 - 5
Error when try to inference
#12 opened by oasis-0927 - 16
RRHF only works on llama model.
#8 opened by Taekyoon - 4
Wombat weights release?
#3 opened by generalsvr - 5