Missing files on MIND dataset
Opened this issue · 6 comments
Could you share some missing files on MIND dataset?
python mind_preprocess.py --content_info
It seems ../data/mind/articles_embeddings_1.pkl is missing, which I thought maybe is generated from get_content_vec.py
.
But how could I load the pre-trained model?
Besides,
python data_process/generate_neighbor.py
It seems publish_time_1.txt is missing?
I will really appreciate it if you could provide above two files.
Thank you!
articles_embeddings_1.pkl
: articles_embeddings_1.zip
publish_time_1.txt
can be generated in mind_preprocess.py
.
Thanks!
Have a good day ;)
Sorry to bother again.
It seems MIND/articles_category.pkl
is also missing.
Although it's simple to generate but I don't know the form of this data structure.
I will appreciate it if you could provide this file.
Thanks!
And could you share your environment?
like python version and tf version?
articles_category
is a dictionary, pseudo code:
articles_category = {}
for each news_id, news_category:
articles_category[news_id] = news_category
pickle.dump(articles_category, open('../articles_category.pkl', 'wb'))
Environment info:
Python: 3.6.9, TensorFlow: 1.14.0
Thank you so much!
It's so nice of you to help me with so many problems.
Enjoy your weekend!
:)