IndoNLP/nusa-crowd
A collaborative project to collect datasets in Indonesian languages.
Jupyter NotebookApache-2.0
Issues
- 0
Create dataset loader for IndoYTASRNews
#372 opened by SamuelCahyawijaya - 0
Create dataset loader for TaPaCo
#371 opened by SamuelCahyawijaya - 0
Create dataset loader for ELI5_ID
#370 opened by SamuelCahyawijaya - 0
Create dataset loader for LexiRumah
#369 opened by SamuelCahyawijaya - 0
- 0
Create dataset loader for QASiNa
#367 opened by SamuelCahyawijaya - 0
- 0
Create dataset loader for YRVSA - Youtube Review Video Sentiment Analysis
#365 opened by SamuelCahyawijaya - 4
- 2
- 1
- 1
- 1
- 1
- 1
Create dataset loader for NusaKalimat
#346 opened by SamuelCahyawijaya - 2
Create dataset loader for NusaParagraph
#347 opened by SamuelCahyawijaya - 0
- 1
Create dataset loader for Open subtitles
#342 opened by SamuelCahyawijaya - 2
Create dataset loader for IndQNER
#329 opened by SamuelCahyawijaya - 2
The dataset kopi_cc with the config name kopi_cc_2022_05-neardup_clean_nusantara_ssp is not complete
#337 opened by cahya-wirawan - 0
Create dataset loader for IJELID (Indonesian-Javanese-English Code-Mixed Language Identification)
#345 opened by SamuelCahyawijaya - 0
Create dataset loader for Graves' Disease Chatbot Dataset in Bahasa Indonesia
#344 opened by SamuelCahyawijaya - 0
- 0
Create dataset loader for IndoSRL
#341 opened by SamuelCahyawijaya - 2
Liputan6 xtreme_train.json file incomplete
#338 opened by gregoriomario - 2
Create dataset loader for Sampiran
#330 opened by SamuelCahyawijaya - 1
Create dataset loader for VoxLingua107
#328 opened by SamuelCahyawijaya - 2
Create dataset loader for id-en-code-mixed
#303 opened by SamuelCahyawijaya - 0
Create a dataset loader for IndQNER
#327 opened by RiaGusmita - 1
Create dataset loader for Cross-lingual Outline- based Dialogue (COD)
#304 opened by SamuelCahyawijaya - 1
- 1
Update the directory name in the guide in CONTRIBUTING.md and DATALOADER.md
#311 opened by VanillaMacchiato - 0
Wrong link on "Example" in task_schemas.md
#317 opened by muhsatrio - 3
Wrong link of "Schema Template" in task_schemas.md
#314 opened by muhsatrio - 0
Add schema documentation for Image Text, Pairs Multilabel, Speech Text, Speech to Speech, Text Multilabel
#318 opened by muhsatrio - 1
- 2
- 2
- 1
- 1
- 1
- 3
- 1
- 1
- 1
- 1
Create dataset loader for NERGrit
#270 opened by SamuelCahyawijaya - 2
- 1
Create dataset loader for SU-ID TTS
#281 opened by SamuelCahyawijaya - 1
Create dataset loader for JV-ID ASR
#282 opened by SamuelCahyawijaya - 1