Issues
- 2
[BUG] RuntimeError: einsum(): the number of subscripts in the equation (3) does not match the number of dimensions (1) for operand 1 and no ellipsis was given
#31 opened by Tempest56890 - 0
from transformers.file_utils import add_start_docstrings, add_start_docstrings_to_callable
#30 opened by Wh9511 - 0
Roberta-large
#29 opened by jinglin-liang - 0
- 1
- 0
philox_cuda_state for an unexpected CUDA generator used during capture. In regions captured by CUDA graphs, you may only use the default CUDA RNG generator on the device that's current when capture begins. If you need a non-default (user-supplied) generator, or a generator on another device, please file an issue.
#26 opened by CCzzzzzzz - 0
features为什么那么大,有办法减小吗?
#25 opened by FeiyuZhang98 - 1
Model Parameters Size?
#24 opened by jzhang38 - 5
关于两种Transformation:Biaffine和Decomposed Linear的疑惑点
#17 opened by scoutys - 2
请问下代码支持在多GPU下训练么
#23 opened by xwjim - 4
reproduce results
#7 opened by jiag19 - 2
模型结构和加载预训练模型时候不太懂
#22 opened by WHW-S - 5
The other two datasets and processing code
#4 opened by ZR5932 - 6
请问如何对更一般的句子做关系抽取
#20 opened by CQUTWangHong - 3
对论文里Transformation Module这一部分不太理解
#21 opened by Heresyrac - 2
请问dataset.py中的distance_buckets 的值是怎么确定出来的?
#19 opened by LawsonAbs - 4
- 2
请问predict_thresh的值是怎么确定的?
#10 opened by Mangoho - 4
关于远程监督数据集上的预训练模型训练情况
#18 opened by zhongy1026 - 1
- 1
Stop Iteration
#15 opened by nguyenvanhoang7398 - 1
- 4
你好,得到result.json 文件后,这个文件该如何理解?
#11 opened by eve1104 - 2
Empty Evidence List
#12 opened by snehasinghania - 2
- 2
Code doesn't run - RAM fills up too quickly
#6 opened by aakashb95 - 1
- 2
请问如何处理文本长度大于512的
#3 opened by Yesgo1220 - 1
论文中的疑惑点
#2 opened by chenhaishun - 3