RobinQrtz

RobinQrtz's Stars

google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
27.5k2.3k
JoeyOhman/SuperLim-2-Testing
Some initial testing of SuperLim 2 tasks
Language:Python42
facebookresearch/KILT
Library for Knowledge Intensive Language Tasks
Language:Python91991
microsoft/infinibatch
Efficient, check-pointed data loading for deep learning with massive data sets.
Language:Python20517
microsoft/torchscale
Foundation Architecture for (M)LLMs
Language:Python3k210
TurkuNLP/finngen-tools
Tools for training causal language models for Finnish
Language:Python271
bminixhofer/nlprule
A fast, low-resource Natural Language Processing and Text Correction library written in Rust.
Language:Rust60539
CPJKU/wechsel
Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.
Language:Python7610
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.7k6.4k
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8.8k632
huggingface/olm-datasets
Pipeline for pulling and processing online language model pretraining data from the web
Language:Python17423
microsoft/adaptive-testing
Find and fix bugs in natural language machine learning models using adaptive testing.
Language:Jupyter Notebook18230
mosaicml/composer
Supercharge Your Model Training
Language:Python5.2k423
attardi/wikiextractor
A tool for extracting plain text from Wikipedia dumps
Language:Python3.8k966
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Language:Python7k1k
NVIDIA/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Language:Python8.5k1.4k
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
Language:C++5.9k895
NVIDIA/DALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Language:C++5.2k622
NVIDIA/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Language:Jupyter Notebook13.7k3.2k
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python10.8k2.4k
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python12.4k2.6k
mediacloud/sentence-splitter
Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.
Language:Python23229
hplt-project/sacremoses
Python port of Moses tokenizer, truecaser and normalizer
Language:Python48959
facebookresearch/GENRE
Autoregressive Entity Retrieval
Language:Python771103
FedML-AI/FedNLP
FedNLP: An Industry and Research Integrated Platform for Federated Learning in Natural Language Processing, Backed by FedML, Inc. The Previous Research Version is Accepted to NAACL 2022
22446
jerbarnes/sentiment_graphs
Graph parsing approach to structured sentiment analysis.
Language:Python406
lucidrains/performer-pytorch
An implementation of Performer, a linear attention-based transformer, in Pytorch
Language:Python1.1k144
idiap/fast-transformers
Pytorch library for fast transformer implementations
Language:Python1.7k179
timoschick/pet
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
Language:Python1.6k282
neelguha/simple-wikidata-db
A set of Python scripts for preprocessing the Wikidata JSON dump and running simple queries in an efficient manner.
Language:Python10819