annotations_creators | language | language_creators | license | multilinguality | pretty_name | size_categories | source_datasets | tags | task_categories | task_ids | |||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
STAIR Captions is a large-scale dataset containing 820,310 Japanese captions. |
|
|
|
|
|
- Dataset Card Creation Guide
- Homepage: http://captions.stair.center/
- Repository: https://github.com/shunk031/huggingface-datasets_STAIR-Captions
- Paper (Preprint): https://arxiv.org/abs/1705.00823
- Paper (ACL'17): https://aclanthology.org/P17-2066/
- Point of Contact: info_AT_stair.center
STAIR Captions is a large-scale dataset containing 820,310 Japanese captions. This dataset can be used for caption generation, multimodal retrieval, and image generation.
[More Information Needed]
The language data in JDocQA is in Japanese (BCP-47 ja-JP).
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
Creative Commons Attribution 4.0 License.
@inproceedings{yoshikawa2017stair,
title={STAIR Captions: Constructing a Large-Scale Japanese Image Caption Dataset},
author={Yoshikawa, Yuya and Shigeto, Yutaro and Takeuchi, Akikazu},
booktitle={Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)},
pages={417--421},
year={2017}
}
Thanks to @yuyay for creating this dataset.