Issues
- 3
预测问题
#99 opened by terminator123 - 0
- 1
这个tokenizer 分词有点慢啊
#105 opened by minmummax - 3
training cuda error
#65 opened by Cherryjingyao - 0
nll_loss_forward_reduce_cuda_kernel_2d: block: [0,0,0], thread: [18,0,0] Assertion t >= 0 && t < n_classes failed.
#113 opened by erminga - 2
lm_loss类型是str,无法计算loss
#110 opened by ziqing0701 - 5
模型维度问题
#100 opened by terminator123 - 0
预训练模型问题
#111 opened by Junlong-Wang - 1
finetune时的log信息在find CDial-GPT_LCCC-large里面的几个json文件,但download下来的CDial-GPT_LCCC-large里面并没有这几个文件,最后报错,报错信息如下,请问是哪里出了问题
#108 opened by haiqizhang - 2
The size of tensor a (571) must match the size of tensor b (512) at non-singleton dimension 3.
#88 opened by jiangliqin - 0
请问在自己的数据上所有的词都用‘ ’(空格)分开吗?
#109 opened by Tylerjoe - 1
有关预训练模型的回复结果的问题
#107 opened by svjack - 12
RuntimeError: CUDA out of memory.
#101 opened by Deerzh - 0
请问这个微博原始数据Weibo Corpus有在哪里提供吗?可以分享吗?
#106 opened by RXJ588 - 4
nll_loss_forward_reduce_cuda_kernel_2d: block: [0,0,0], thread: [18,0,0] Assertion t >= 0 && t < n_classes failed.
#81 opened by Alexia1994 - 0
IndexError: Target -1 is out of bounds.
#104 opened by guantao18 - 3
seeking STC dataset
#98 opened by Zhou-Zoey - 2
RuntimeError: Address already in use
#103 opened by AlexKai1 - 3
支持incremental decoding吗?
#93 opened by songmzhang - 3
_pickle.UnpicklingError: invalid load key, 'v'.
#94 opened by cuiding - 1
请问下GPT_{Novel}模型在哪里可以找到呢?
#96 opened by mianzhang - 0
启用半精度后,训练loss = nan
#102 opened by WuDiDaBinGe - 2
RuntimeError: CUDA error: CUBLAS_STATUS_ALLOC_FAILED when calling `cublasCreate(handle)`
#70 opened by RyanYip-Kat - 4
训练资源和时间没那么充裕,可以提供下跑完的模型结果吗
#69 opened by jiangliqin - 2
学习率的问题?学习率最大6.25e-5
#97 opened by WuDiDaBinGe - 0
评估指标ppl
#95 opened by xiao-ming-code - 3
- 1
train
#73 opened by xiao-ming-code - 1
- 1
from pretrian error
#64 opened by iris-qq - 2
TypeError: string indices must be integers
#72 opened by Jyukai02 - 1
您好,是否方便提供数据清洗中标注数据?
#71 opened by wakafengfan - 1
interact为什么会出现后面无限循环的情况?如何解决呢?
#90 opened by jiangjyjy - 3
LCCC new link
#92 opened by stephenroller - 1
infer
#76 opened by xiao-ming-code - 1
训练的时候数据
#77 opened by xiao-ming-code - 1
interact.py 错误
#84 opened by Amazing-J - 5
关于预训练时小说数据的格式问题
#80 opened by realLINorth - 2
用STC数据进行fine-tune是,用哪个指标保存最好的模型
#83 opened by GongMingGithub - 1
云盘无法下载
#91 opened by qiangqiang-he - 1
embedding average 计算中中文分词是如何处理的。
#87 opened by allyouneeds - 3
直接用中文文本训练
#89 opened by nuochenpku - 0
evaluation
#85 opened by zhao1402072392 - 2
- 1
LCCC-base-split不能解压
#82 opened by bingo789 - 1
能不能公开一下LCCC-base未分词的数据集
#79 opened by leelinglin - 1
你们预训练使用的中文小说数据在哪能下载?
#78 opened by demeiyan - 2
结果并没有中介绍的好
#74 opened by changleilei - 1
字嵌入向量问题
#67 opened by Ultraman-Orb - 0