Issues
- 15
test_ch.yaml文件
#27 opened by jiangix-paper - 0
padding部分不需要计算loss
#56 opened by Play2Boy - 0
- 0
关于bloom的token裁剪
#54 opened by yangjianxin1 - 0
关于mengzi-gpt-neo-base某些字无法正常显示的问题
#53 opened by chn-lee-yumi - 2
- 1
没找到官方的预训练脚本,只用MLM做继续预训练效果如何?
#50 opened by ShadowTeamCN - 2
mengzi-gpt-neo-base在huggingface上无法体验,有异常爆出
#51 opened by liruixue - 0
文本生成落地怎么做的
#49 opened by ZTurboX - 2
Input prefix of the model mengzi-t5-base
#46 opened by KarenMars - 2
Mengzi-T5-base-MT模型大小
#48 opened by yuange555 - 0
请问一下Mengzi-BERT-large模型会不会被release
#47 opened by hopegithub - 0
- 1
tensorflow版本mengzi-bert-base需要朋友可以下载
#30 opened by YuandZhang - 1
词表的逗号和括号是英文符号
#42 opened by ruoyusong - 0
- 0
词汇表中的浮点数代表什么
#41 opened by Ponyo1 - 0
预训练 X152-C4 模型 抽取图片特征时输入数据(tsv文件)的格式是?
#40 opened by lvbu12 - 1
可以多写两个example的script吗
#39 opened by cabisarri - 1
适合做多轮对话任务吗?
#38 opened by jiangliqin - 1
微信讨论群满
#36 opened by Fanchao-Qi - 2
关于自然语言理解任务的问题
#34 opened by JaheimLee - 1
T5-small版模型
#26 opened by JaheimLee - 1
请问有开源 mengzi-large 相关模型的计划吗?
#32 opened by cingtiye - 3
- 9
- 3
预训练mengzi是可以按照预训练BERT的方式吗?
#24 opened by sherlcok314159 - 1
- 1
batch size究竟是128还是16384
#21 opened by hankcs - 1
请问Mengzi-T5-base的预训练任务是DAE还是LM?
#20 opened by hankcs - 3
What is the input format for the model to automatically generate marketing copy?
#16 opened by Nipi64310 - 1
请问预训练的schedule是怎么设置的
#19 opened by NinedayWang - 1
我建议用同样的测试脚本重新测一下RoBERTa
#17 opened by bojone