mhtang1995/BOPN

请问作者能否提供数据集或者这些数据集处理成json格式的方式呢?我在复现您的代码时在打开json格式的数据时总是报错,因为我自己下载的比如这些中文数据集的原始格式并不是json

Opened this issue · 1 comments

请问作者能否提供数据集或者这些数据集处理成json格式的方式呢?我在复现您的代码时在打开json格式的数据时总是报错,因为我自己下载的比如这些中文数据集的原始格式并不是json

[ {
"sentence": [
"AFP_ENG_20030428",
".",
"0720",
"NEWS",
"STORY",
"20030428",
"NKorea",
"offers",
"to",
"scrap",
"nuke",
",",
"missile",
"programs",
"but",
"wants",
"big",
"concessions",
":",
"US",
"by",
"Matthew",
"Lee",
"ATTENTION",
"-",
"UPDATES",
"///",
"WASHINGTON",
",",
"April",
"28",
"(",
"AFP",
")",
"-",
"North",
"Korea",
"has",
"offered",
"to",
"scrap",
"its",
"nuclear",
"weapons",
"and",
"missile",
"programs",
",",
"but",
"only",
"in",
"return",
"for",
""",
"considerable",
""",
"diplomatic",
",",
"political",
"and",
"economic",
"concessions",
",",
"the",
"United",
"States",
"said",
"Monday",
"."
],
"ner": [
{
"index": [
6
],
"type": "GPE"
},
{
"index": [
10
],
"type": "WEA"
},
{
"index": [
12
],
"type": "WEA"
},
{
"index": [
19
],
"type": "GPE"
},
{
"index": [
21,
22
],
"type": "PER"
},
{
"index": [
27
],
"type": "GPE"
},
{
"index": [
32
],
"type": "ORG"
},
{
"index": [
35,
36
],
"type": "GPE"
},
{
"index": [
41
],
"type": "GPE"
},
{
"index": [
41,
42,
43
],
"type": "WEA"
},
{
"index": [
45
],
"type": "WEA"
},
{
"index": [
63,
64,
65
],
"type": "GPE"
}
]
}] 这种格式的