Alibaba-NLP/HiAGM

How to get the vocab files of wos dataset?

Opened this issue · 1 comments

It seems that preprocess_wos.py doesn't generate the files, word.dict and label.dict. I generate them myself by collecting every word appearing in the training set, but the test result is significantly worse than the result published. Could you tell me how you generate the vocab files?

看来 preprocess_wos.py 不会生成文件 word.dict 和 label.dict。我通过收集训练集中出现的每个单词来自己生成它们,但测试结果明显比发布的结果差。你能告诉我你是如何生成词汇文件的吗?

同样的问题,请问您解决了吗?