wind91725/gpt2-ml-finetune-

根据gpt2-ml中文模型finetune自己的数据集

PythonApache-2.0

Issues

why pre_data output no text (pre_data為什麼會把文本處理成空)
#42 opened 2 years ago by snowflowersnowflake
0
执行第二步出错
#32 opened 3 years ago
2
AttributeError: module 'tensorflow.compat.v1' has no attribute 'contrib'
#13 opened 4 years ago by qiujz
3
【讨论】gpt2-ml，30G，22w步模型微调报错解决方案
#31 opened 3 years ago by NLPIG
1
word_embed:0 ((21130, 1536))、word_embed ([8021, 1536]) 数组维度不一致有解决办法吗？
#30 opened 3 years ago by NLPIG
0
可以给一个输入数据的例子吗，就是 pre_data.py的输入文件格式
#10 opened 4 years ago by EiraZhang
9
请问一下 batch_size=1的微调情况下，learning rate的选择？
#28 opened 3 years ago by huangdacheng
0
自己的数据集需要多大效果会比较好
#7 opened 4 years ago by fred-github
9
/data/home/share1/gpt2-ml-Finetune/data-mayun_xiugai
#27 opened 4 years ago by xukai98
0
CUDA_VISIBLE_DEVICES=0 是做什么的呢？
#8 opened 4 years ago by lrz512699597
1
train/train_wc.py的输出路径是不是改为原模型地址
#20 opened 4 years ago by cncbec
3
为啥会出现：is not in all_model_checkpoint_paths
#23 opened 4 years ago by cncbec
1
finetuning的batchsize是1 就可以了？
#24 opened 4 years ago by huangdacheng
0
和原来gpt2-ml的demo.py 进行inference的代码对比，使用的不是同一个vocabulary？
#15 opened 4 years ago by huangdacheng
3
22.json可以生成，train.tfrecord生成不了
#22 opened 4 years ago by cncbec
0
loss下降到多少是差不多了
#21 opened 4 years ago by joytianya
0
bs=1，训练有效果么
#16 opened 4 years ago by joytianya
1
gpt2_ml的模型更新了，可以更新一下代码吗？还有，可以出个colab版的吗，运行起来也方便一些
#11 opened 4 years ago by suxuan123
5
加载原gpt2-ml的大模型，9G的显存也会满了,报OOM，请问作者是使用多大显存？
#17 opened 4 years ago by huangdacheng
1
生成tfrecord这一步时出错
#14 opened 4 years ago by dakkor
1
gpu最低要求？
#12 opened 4 years ago by kifish
2
finetune时，总是找不到预训练模型文件是怎么回事呢？
#9 opened 4 years ago by lrz512699597
0
pre_data.py
#6 opened 4 years ago by lrz512699597
3
filepath /data/home/share1/gpt2-ml-Finetune/data-mayun_xiugai
#3 opened 4 years ago by SeekPoint
1
"iterations_per_loop", 1600,
#5 opened 4 years ago by lrz512699597
0
能不能使用多机多卡训练，小作坊用的是4*4 1080ti,
#4 opened 4 years ago by SeekPoint
0