gentaiscool
Researcher @ Bloomberg. Natural Language Processing, Speech, Multilingual, Code-switching, Dialogue
Bloomberg LPNew York
Pinned Repositories
NL-Augmenter
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
code-switching-papers
A curated list of research papers and resources on code-switching
end2end-asr-pytorch
End-to-End Automatic Speech Recognition on PyTorch
few-shot-lm
The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)
indonesian-nlp
A curated list of research papers and resources on Indonesian languages
lstm-attention
Attention-based bidirectional LSTM for Classification Task (ICASSP)
ros-vrep-slam
ROS and V-REP for Robot Mapping and Localization
indonlu
The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)
nusa-crowd
A collaborative project to collect datasets in Indonesian languages.
nusax
High-quality parallel resource on sentiment analysis for 10 low-resource Indonesian languages, English, and Indonesian (Outstanding Paper at EACL 2023)
gentaiscool's Repositories
gentaiscool/ros-vrep-slam
ROS and V-REP for Robot Mapping and Localization
gentaiscool/cnn-autoencoder-tf
CNN and Contrastive Autoencoder (CAE) on EMNIST using Tensorflow
gentaiscool/multi-task-cs-lm
Code-Switching Language Modeling using Syntax-Aware Multi-Task Learning (CALCS 2018, ACL)
gentaiscool/pmf
Probabilistic Matrix Factorization on MovieLens 100K
gentaiscool/scikit-learn-examples
Exploration on Logistic Regression, MLP, and SVM using Scikit-learn
gentaiscool/scatternet
Generating scatternet features
gentaiscool/Listen-Attend-and-Spell-Pytorch
Listen Attend and Spell (LAS) implement in pytorch
gentaiscool/chicken-scheme
Permutation in Chicken Scheme
gentaiscool/cyk-parser
Cocke–Younger–Kasami Algorithm Parser
gentaiscool/deep-nlp-reading-list
Deep Learning / Machine Learning reading list - mainly related to NLP
gentaiscool/pokeranch-imba
Pokemon Game in Android
gentaiscool/tripsquare
Collaborative real-time travel app planner for HackUST 2017 transportation category
gentaiscool/agem
Official implementation of the Averaged Gradient Episodic Memory (A-GEM) in Tensorflow
gentaiscool/bert-as-service
Mapping a variable-length sentence to a fixed-length vector using BERT model
gentaiscool/cihangxie.github.io
Personal homepage:
gentaiscool/CodeSwitch-Reddit
gentaiscool/coloremoji.sty
Style package for directly including color emojis in latex documents
gentaiscool/ctc-asr
End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
gentaiscool/datasets-CMU_Wilderness
CMU Wilderness Multilingual Speech Dataset
gentaiscool/Deep-Learning-Book-Chapter-Summaries
Attempting to make the Deep Learning Book easier to understand.
gentaiscool/deepspeech.pytorch
Speech Recognition using DeepSpeech2.
gentaiscool/e2e_asr
End-to-end speech recognition using encoder-decoder model with auxiliary tasks on lower layers
gentaiscool/gt-nlp-class
Course materials for Georgia Tech CS 4650 and 7650, "Natural Language"
gentaiscool/indic_nlp_library
Resources and tools for Indian language Natural Language Processing
gentaiscool/interspeech2019-tutorial
INTERSPEECH 2019 Tutorial Materials
gentaiscool/LASER
Language-Agnostic SEntence Representations
gentaiscool/ner-dataset-modified-dee
The Datasets to Build Indonesian Named Entity Recognizer
gentaiscool/pt.darts
PyTorch Implementation of DARTS: Differentiable Architecture Search
gentaiscool/Sentence-VAE
PyTorch Re-Implementation of "Generating Sentences from a Continuous Space" by Bowman et al 2015 https://arxiv.org/abs/1511.06349
gentaiscool/umwe
Unsupervised Multilingual Word Embeddings (EMNLP 2018)