SEACrowd/seacrowd-datahub
A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.
PythonApache-2.0
Issues
- 1
Create dataset loader for META MMLU (Thai)
#730 opened by wannaphong - 1
Create dataset loader for IndoAbusive
#717 opened by SamuelCahyawijaya - 1
Create dataset loader for Sentiment-Annotated Taglish Product and Service Reviews (SentiTaglish: Products and Services)
#720 opened by SamuelCahyawijaya - 0
- 0
Create dataset loader for VLUE
#725 opened by SamuelCahyawijaya - 0
Create dataset loader for thai_exam
#724 opened by SamuelCahyawijaya - 0
- 0
Create dataset loader for AMADI_LontarSet Isolated Character Recognition of Balinese Script in Palm Leaf Manuscript Images
#722 opened by SamuelCahyawijaya - 0
Create dataset loader for AMADI_LontarSet Query-by-Example Word Spotting on Palm Leaf Manuscript Images
#721 opened by SamuelCahyawijaya - 0
Create dataset loader for NECID
#719 opened by SamuelCahyawijaya - 0
Create dataset loader for IndoACD
#718 opened by SamuelCahyawijaya - 0
Create dataset loader for SAInT
#716 opened by SamuelCahyawijaya - 0
Create dataset loader for Thai Handwritten Free Datasets by Wang: Data Market
#715 opened by SamuelCahyawijaya - 0
- 0
Create dataset loader for NERSkill.Id
#713 opened by SamuelCahyawijaya - 0
- 0
Create dataset loader for Parallel Corpus Dataset of Indonesian and Bengkulu Malay Language
#711 opened by SamuelCahyawijaya - 0
Create dataset loader for IndoCulture
#710 opened by SamuelCahyawijaya - 0
- 0
Create dataset loader for Lexibank
#708 opened by SamuelCahyawijaya - 0
Create dataset loader for SCH
#707 opened by SamuelCahyawijaya - 0
Create dataset loader for AlloVera
#706 opened by SamuelCahyawijaya - 0
Create dataset loader for DaMuEL
#705 opened by SamuelCahyawijaya - 0
Create dataset loader for Cross-Lingual Data Augmentation For Thai QA
#704 opened by SamuelCahyawijaya - 0
- 0
Create dataset loader for Bible Corpus
#702 opened by SamuelCahyawijaya - 0
Create dataset loader for University of Maryland Parallel Corpus Project: The Bible
#701 opened by SamuelCahyawijaya - 0
Create dataset loader for eBible
#700 opened by SamuelCahyawijaya - 0
Create dataset loader for OpenMSD
#699 opened by SamuelCahyawijaya - 0
Create dataset loader for M3IT
#698 opened by SamuelCahyawijaya - 0
Create dataset loader for Corpus Crawler
#697 opened by SamuelCahyawijaya - 0
Create dataset loader for DeepLontar
#696 opened by SamuelCahyawijaya - 0
- 0
Create dataset loader for AMADI_LontarSet Binarization of Palm Leaf Manuscript Images
#694 opened by SamuelCahyawijaya - 0
Create dataset loader for GlobalVoices
#693 opened by SamuelCahyawijaya - 0
Create dataset loader for Asian Signbank
#692 opened by SamuelCahyawijaya - 0
Create dataset loader for Grambank
#691 opened by SamuelCahyawijaya - 0
Create dataset loader for Amanatun wordlist
#690 opened by SamuelCahyawijaya - 0
Create dataset loader for MLQE-PE
#689 opened by SamuelCahyawijaya - 0
- 0
Create dataset loader for prachathai-67k
#687 opened by SamuelCahyawijaya - 0
- 0
Unicode no longer hosts the UDHR, so the seacrowd-datahub does not either.
#685 opened by kargaranamir - 0
- 0
Create dataset loader for ProSub
#683 opened by SamuelCahyawijaya - 0
Create dataset loader for Glot500-c
#682 opened by SamuelCahyawijaya - 0
Create dataset loader for IndoModal
#681 opened by SamuelCahyawijaya - 0
Create dataset loader for TrueVoice Intent
#680 opened by SamuelCahyawijaya - 0
Create dataset loader for Automated Similarity Judgment Program (ASJP)
#665 opened by SamuelCahyawijaya - 0