/AMNN

AMNN: Attention-based Multimodal Neural Network Model for Hashtag Recommendation

Primary LanguageJupyter Notebook

AMNN

AMNN: Attention-based Multimodal Neural Network Model for Hashtag Recommendation

Dataset:

We collect almost 248,166 public microblogs according to selected 97 hashtags of "Top 100" on Instagram. The final collection contains 56861 microblogs which include both text and image, called MultiModal data from Instagram (MM-INS). We filter duplicate hashtags in one sample and drop out those microblogs without texts.

This dataset is a collection of crawled microblogs from Instagram by using Instaloader API, https://instaloader.github.io/. As the raw dataset is too larger to upload all of them, we choose 3 sub-datasets without preprocessing, including "#beach", "#cat", "#dog", and the corresponding sub-datasets with preprocessing that remove those images without texts, including "beach", "cat", "dog". In addition, more samples can be found in Google Driver. Hope the data can be helpful for your research, and we are open for academic cooperation if necessary.

Open access: