Issues
- 4
about gpt-4-0125-preview reference answer
#21 opened by duguodong7 - 11
can use Qwen1.5-7B-Chat ?
#13 opened by 18600709862 - 4
Starling-LM-7B-alpha tokenizer issues
#19 opened by duguodong7 - 2
Evaluation
#20 opened by ZLKong - 3
Out of Memory Issue with OpenLLaMA-7B in Default FuseLLM Setting on A100 (80G)
#16 opened by runtsang - 0
how should I load minipile
#15 opened by BlueCestbon - 1
使用fastchat加载对话模板是什么?
#14 opened by BoFan-tunning - 2
Purpose for the Split long text step
#12 opened by ZLKong - 0
Releasing FuseLLM for Korean
#11 opened by sigridjineth - 19
Encountering NaN grad_norm and loss values when training with DeepSpeed and OrionForCausalLM model
#9 opened by sigridjineth - 3
- 3
- 4
minipile_split issue
#8 opened by Arbor334 - 6
Regarding MiniPile dataset splitting
#5 opened by ZLKong - 1
- 3
- 2