summmeer/session-based-news-recommendation

Missing files on MIND dataset

Opened this issue · 6 comments

Could you share some missing files on MIND dataset?
python mind_preprocess.py --content_info
It seems ../data/mind/articles_embeddings_1.pkl is missing, which I thought maybe is generated from get_content_vec.py.
But how could I load the pre-trained model?

Besides,
python data_process/generate_neighbor.py
It seems publish_time_1.txt is missing?

I will really appreciate it if you could provide above two files.
Thank you!

articles_embeddings_1.pkl: articles_embeddings_1.zip

publish_time_1.txt can be generated in mind_preprocess.py.

Thanks!
Have a good day ;)

Sorry to bother again.
It seems MIND/articles_category.pkl is also missing.
Although it's simple to generate but I don't know the form of this data structure.
I will appreciate it if you could provide this file.
Thanks!

And could you share your environment?
like python version and tf version?

articles_category is a dictionary, pseudo code:

articles_category = {}
for each news_id, news_category:
    articles_category[news_id] = news_category
pickle.dump(articles_category, open('../articles_category.pkl', 'wb'))

Environment info:
Python: 3.6.9, TensorFlow: 1.14.0

Thank you so much!
It's so nice of you to help me with so many problems.
Enjoy your weekend!
:)