gentaiscool
Researcher @ Bloomberg. Natural Language Processing, Speech, Multilingual, Code-switching, Dialogue
Bloomberg LPNew York
Pinned Repositories
NL-Augmenter
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
code-switching-papers
A curated list of research papers and resources on code-switching
end2end-asr-pytorch
End-to-End Automatic Speech Recognition on PyTorch
few-shot-lm
The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)
indonesian-nlp
A curated list of research papers and resources on Indonesian languages
lstm-attention
Attention-based bidirectional LSTM for Classification Task (ICASSP)
ros-vrep-slam
ROS and V-REP for Robot Mapping and Localization
indonlu
The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)
nusa-crowd
A collaborative project to collect datasets in Indonesian languages.
nusax
High-quality parallel resource on sentiment analysis for 10 low-resource Indonesian languages, English, and Indonesian (Outstanding Paper at EACL 2023)
gentaiscool's Repositories
gentaiscool/end2end-asr-pytorch
End-to-End Automatic Speech Recognition on PyTorch
gentaiscool/code-switching-papers
A curated list of research papers and resources on code-switching
gentaiscool/lstm-attention
Attention-based bidirectional LSTM for Classification Task (ICASSP)
gentaiscool/few-shot-lm
The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)
gentaiscool/indonesian-nlp
A curated list of research papers and resources on Indonesian languages
gentaiscool/meta-emb
Multilingual Meta-Embeddings for Named Entity Recognition (RepL4NLP & EMNLP 2019)
gentaiscool/gentaiscool.github.io
My website
gentaiscool/matrix_fact
Matrix Factorization Library
gentaiscool/xnli-dataset
gentaiscool/acl-anthology
Data and software for building the ACL Anthology.
gentaiscool/aclpub2
gentaiscool/al-folio
A beautiful, simple, clean, and responsive Jekyll theme for academics
gentaiscool/BIG-bench
Beyond the Imitation Game collaborative benchmark for enormous language models
gentaiscool/calcs2023
gentaiscool/calcs2023_ingest
gentaiscool/calcs2023_test
gentaiscool/DataLab
The unified platform for data-related resources.
gentaiscool/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
gentaiscool/do-we-need-attention
gentaiscool/GitHubGraduation-2021
Join the GitHub Graduation Yearbook and "walk the stage" on June 5.
gentaiscool/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
gentaiscool/mesh-transformer-jax
Model parallel transformers in JAX and Haiku
gentaiscool/NER-datasets
Datasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)
gentaiscool/NL-Augmenter
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
gentaiscool/nusa-datasets
gentaiscool/PromptPapers
Must-read papers on prompt-based tuning for pre-trained language models.
gentaiscool/promptsource
Toolkit for creating, sharing and using natural language prompts.
gentaiscool/seqeval
A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
gentaiscool/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
gentaiscool/xtreme
XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 typologically diverse languages and includes nine tasks.