模型中是如何获取头尾实体，交互只是预训练模型的注意力矩阵吗？如何输出保存最好的模型呢，我跑出的结果相差很大

Question

模型中是如何获取头尾实体，交互只是预训练模型的注意力矩阵吗？如何输出保存最好的模型呢，我跑出的结果相差很大

Closed this issue 5 months ago · 6 comments

Answer 1 · 2024-01-06T13:54:10.000Z

Answer 2 · 2024-01-06T14:52:28.000Z

你好，在 https://github.com/wtangdev/UniRel/blob/1dbc2e72d0ed1a8eeb418883dd8dbb0cd797876d/run.py#L340C17-L340C32 中会保存dev集上最好结果的checkpoint。

效果不好，请确认下使用的是否的是正确的数据集，test_data_tyep = unirel_span对应的是nyt数据集（而不是nyt_star）

Answer 3 · 2024-01-07T12:25:40.000Z

非常感谢您的回复，我想知道test_data_tyep的参数有哪几个呢，分别代表什么意思呀，我看运行出来输出也不一样 ? ***@***.***  

…

------------------ 原始邮件 ------------------ 发件人: "wtangdev/UniRel" ***@***.***>; 发送时间: 2024年1月6日(星期六) 晚上10:52 ***@***.***>; ***@***.******@***.***>; 主题: Re: [wtangdev/UniRel] 模型中是如何获取头尾实体，交互只是预训练模型的注意力矩阵吗？如何输出保存最好的模型呢，我跑出的结果相差很大 (Issue #16) 你好，在 https://github.com/wtangdev/UniRel/blob/1dbc2e72d0ed1a8eeb418883dd8dbb0cd797876d/run.py#L340C17-L340C32 中会保存dev集上最好结果的checkpoint。效果不好，请确认下使用的是否的是正确的数据集，test_data_tyep = unirel_span对应的是nyt数据集（而不是nyt_star） — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: ***@***.***>

Answer 4 · 2024-03-01T12:38:17.000Z

我按照正确的方式下载webnlg数据并训练，报如下错误：
embeddings = inputs_embeds + token_type_embeddings
RuntimeError: The size of tensor a (334) must match the size of tensor b (322) at non-singleton dimension 1
这是为什么，数据集用的是readme.md所提供的。

Answer 5 · 2024-03-06T07:17:49.000Z

抱歉回复的很晚。

在 run.py line 30:
DataProcessorDict = {
"nyt_all_sa": UniRelDataProcessor,
"unirel_span": UniRelDataProcessor
}
可以看到只有nyt_all_sa/unirel_span两种，其中nyt_all_sa是适合 nyt*/webnlg*的，unirel_span是适合nyt/webnlg的。（也抱歉命名容易产生误解）

Answer 6 · 2024-03-06T07:18:34.000Z

我按照正确的方式下载webnlg数据并训练，报如下错误： embeddings = inputs_embeds + token_type_embeddings RuntimeError: The size of tensor a (334) must match the size of tensor b (322) at non-singleton dimension 1 这是为什么，数据集用的是readme.md所提供的。

可以看看是不是没有区分WebNLG和WebNLG*数据集。