RobinQrtz's Stars
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
JoeyOhman/SuperLim-2-Testing
Some initial testing of SuperLim 2 tasks
facebookresearch/KILT
Library for Knowledge Intensive Language Tasks
microsoft/infinibatch
Efficient, check-pointed data loading for deep learning with massive data sets.
microsoft/torchscale
Foundation Architecture for (M)LLMs
TurkuNLP/finngen-tools
Tools for training causal language models for Finnish
bminixhofer/nlprule
A fast, low-resource Natural Language Processing and Text Correction library written in Rust.
CPJKU/wechsel
Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
huggingface/olm-datasets
Pipeline for pulling and processing online language model pretraining data from the web
microsoft/adaptive-testing
Find and fix bugs in natural language machine learning models using adaptive testing.
mosaicml/composer
Supercharge Your Model Training
attardi/wikiextractor
A tool for extracting plain text from Wikipedia dumps
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
NVIDIA/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
NVIDIA/DALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
NVIDIA/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
mediacloud/sentence-splitter
Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.
hplt-project/sacremoses
Python port of Moses tokenizer, truecaser and normalizer
facebookresearch/GENRE
Autoregressive Entity Retrieval
FedML-AI/FedNLP
FedNLP: An Industry and Research Integrated Platform for Federated Learning in Natural Language Processing, Backed by FedML, Inc. The Previous Research Version is Accepted to NAACL 2022
jerbarnes/sentiment_graphs
Graph parsing approach to structured sentiment analysis.
lucidrains/performer-pytorch
An implementation of Performer, a linear attention-based transformer, in Pytorch
idiap/fast-transformers
Pytorch library for fast transformer implementations
timoschick/pet
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
neelguha/simple-wikidata-db
A set of Python scripts for preprocessing the Wikidata JSON dump and running simple queries in an efficient manner.