gentaiscool

Researcher @ Capital One AI Foundations. Natural Language Processing, Speech, Multilingual, Code-switching, Dialogue

Capital One AI FoundationsNew York

gentaiscool's Stars

meta-llama/llama
Inference code for Llama models
Language:Python53.7k 508 9239.3k
Nyandwi/machine_learning_complete
A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.
Language:Jupyter Notebook4.5k 84 4732
state-spaces/s4
Structured state space sequence models
Language:Jupyter Notebook2.2k 49 128266
IndoNLP/nusa-crowd
A collaborative project to collect datasets in Indonesian languages.
Language:Jupyter Notebook251 6 19160
LAION-AI/Open-Instruction-Generalist
Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks
Language:Python203 13 919
shauryr/ACL-anthology-corpus
This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs
Language:Jupyter Notebook164 7 313
srush/do-we-need-attention
Language:TeX159 8 17
forestagostinelli/DeepCubeA
Code for DeepCubeA, a Deep Reinforcement Learning algorithm that can learn to solve the Rubik's cube.
Language:Python144 4 150
ExpressAI/DataLab
The unified platform for data-related resources.
Language:Python126 11 12928
nlp-uoregon/Okapi
Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
Language:Python82 5 42
IndoNLP/nusax
High-quality parallel resource on sentiment analysis for 10 low-resource Indonesian languages, English, and Indonesian (Outstanding Paper at EACL 2023)
Language:Jupyter Notebook80 8 08
bloomberg/minilmv2.bb
Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)
Language:Python59 8 26
gentaiscool/indonesian-nlp
A curated list of research papers and resources on Indonesian languages
39 6 03
bloomberg/kbir_keybart
Experimental code used in pre-training the KBIR and KeyBART models
Language:Python26 6 13
IndoNLP/nusa-writes
NusaWrites is an in-depth analysis of corpora collection strategy and a comprehensive language modeling benchmark for underrepresented and extremely low-resource Indonesian local languages.
Language:Jupyter Notebook26 6 21
bltlab/mot
Multilingual Open Text
Language:Python24 3 44
yogisalomo/english-speaker-friendly-korean-companies
Repository to aggregate data about Korean companies that works with English as official language or accepts non-Korean speaking members
24 5 48
bltlab/paranames
ParaNames: A multilingual resource for parallel names
Language:Python23 2 43
HLTCHKUST/KnowExpert
The implementation of the paper "Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters".
Language:Python17 6 43
Southeast-Asia-NLP/LLM-Code-Mixing
Can LLMs generate code-mixed sentences through zero-shot prompting?
10 3 00
aparnadutta/code-mixed-lid
Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.
Language:Python8 2 01
IndoNLP/nusa-catalogue
Dataset Catalogue Homepage for Indonesian Languages
Language:JavaScript6 5 56
neulab/globalbench
GlobalBench: A Benchmark for Global Progress in Language Technology
Language:Python6 2 0
kongaskristjan/rubik
Solve a Rubik's Cube with neural networks
Language:Python5 3 20
bharathichezhiyan/HopeEDI
HopeEDI: A Multilingual Hope Speech Detection Dataset for Equality, Diversity, and Inclusion
3 2 00
Genius1237/numpy-gpt2
Language:Python1 1 0
holylovenia/emoji-GAN
HKUST's ELEC5680/COMP5214 Advanced Deep Learning Architectures Assignment 3
Language:Python1 2 30
IndoNLP/.github
Landing page
1 3 00
IndoNLP/indonlp.github.io
Language:SCSS1 4 02
wenliangdai/Weakly-Supervised-Multitask-MAR
Weakly-supervised Multitask Multimodal Affect Recognition.
1 3 0

gentaiscool

gentaiscool's Stars

meta-llama/llama

Nyandwi/machine_learning_complete

state-spaces/s4

IndoNLP/nusa-crowd

LAION-AI/Open-Instruction-Generalist

shauryr/ACL-anthology-corpus

srush/do-we-need-attention

forestagostinelli/DeepCubeA

ExpressAI/DataLab

nlp-uoregon/Okapi

IndoNLP/nusax

bloomberg/minilmv2.bb

gentaiscool/indonesian-nlp

bloomberg/kbir_keybart

IndoNLP/nusa-writes

bltlab/mot

yogisalomo/english-speaker-friendly-korean-companies

bltlab/paranames

HLTCHKUST/KnowExpert

Southeast-Asia-NLP/LLM-Code-Mixing

aparnadutta/code-mixed-lid

IndoNLP/nusa-catalogue

neulab/globalbench

kongaskristjan/rubik

bharathichezhiyan/HopeEDI

Genius1237/numpy-gpt2

holylovenia/emoji-GAN

IndoNLP/.github

IndoNLP/indonlp.github.io

wenliangdai/Weakly-Supervised-Multitask-MAR