Issues
- 1
- 2
help 我单机测试两台机器都能正常,但是多机器并行后会出现环境问题
#23 opened by XiaoqingNLP - 0
- 3
should docker pull gyxthu17/cpm-2:1.2 be ran under the root dir of this reository?
#27 opened by lsy641 - 0
MOE 训练问题?
#28 opened by XiaoqingNLP - 9
环境问题
#29 opened by 520jefferson - 0
使用docker 容器运行程序,这个host_file应该如何设置IP?
#30 opened by XiaoqingNLP - 2
CPM-2-Pretrain-moe的词表问题
#17 opened by jiayuchennlp - 5
专家网络参数应该broadcast么
#14 opened by icodingc - 8
- 10
- 1
- 0
about the parameters of MOE?
#26 opened by XiaoqingNLP - 2
Model parallelism of CPM2-MoE
#15 opened by MichaelXSChen - 0
数据预处理tokenize无法处理特殊token
#25 opened by zetian1025 - 2
CPM2模型推理代码
#22 opened by Bournet - 4
- 9
关于CPM2模型生成的问题
#16 opened by jiayuchennlp - 3
Dev/Test split for Math23K dataset
#3 opened by sjy1203 - 3
Link for 《CUGE: A chinese language understanding and generation evaluation benchmark》
#2 opened by sjy1203 - 0
如何加载CPM2.1 或者cmp2.0 模型进行微调?
#21 opened by XiaoqingNLP - 1
- 1
文本生成强化版模型 CPM2.1的链接是哪个?
#18 opened by chenjunqiang - 1
CPM-2-Pretrain是否没有transformer版本?
#19 opened by NemoCoder - 1
CPM-2-Pretrain数据处理与读取问题
#10 opened by RoyZhanyi - 5
Deepspeed Zero3?
#12 opened by k15201363625 - 2
Model checkpoint convert
#13 opened by k15201363625 - 1
The import path is not correct.
#8 opened by xia-xiao - 1
您好,CPM-2技术报告的链接404了
#6 opened by wakafengfan - 0
NA
#5 opened by ShenDezhou - 3
请问后续会放出sample供使用参考吗?
#4 opened by NLPIG - 1