/CMNER

The Chinese MNER dataset.

Primary LanguagePython

CMNER

Here is the Chinese MNER dataset based on social media. 
Each post is composed of a text and relevant images. Every text has an unique "wid" and the images named with this "wid" at the beginning are the corresponding images for the text.

Citation

If you find this code to be useful for your research, please consider citing.

https://arxiv.org/abs/2402.13693
@misc{ji2024cmner,
      title={CMNER: A Chinese Multimodal NER Dataset based on Social Media}, 
      author={Yuanze Ji and Bobo Li and Jun Zhou and Fei Li and Chong Teng and Donghong Ji},
      year={2024},
      eprint={2402.13693},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}