mathemakitten's Stars
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
srush/GPU-Puzzles
Solve puzzles. Learn CUDA.
elyase/awesome-gpt3
thunlp/PLMpapers
Must-read Papers on pre-trained language models.
young-geng/EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
microsoft/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
lightswitch05/hosts
Hostfile blocklist for ads and tracking, updated regularly
microsoft/TextWorld
TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.
Separius/awesome-fast-attention
list of efficient attention modules
bigscience-workshop/bigscience
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
unitaryai/detoxify
Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unitary.ai.
huggingface/transformers-bloom-inference
Fast Inference Solutions for BLOOM
UDST/urbansim
Platform for building statistical models of cities and regions
akamhy/waybackpy
Wayback Machine API interface & a command-line tool
nivbend/gitstery
A Git Murder Mystery
NVIDIA/NeMo-Megatron-Launcher
NeMo Megatron launcher and tools
bigcode-project/bigcode-dataset
google/CommonLoopUtils
CLU lets you write beautiful training loops in JAX.
huggingface/datablations
Scaling Data-Constrained Language Models
bigscience-workshop/data-preparation
Code used for sourcing and cleaning the BigScience ROOTS corpus
stas00/toolbox
Essential guides and programming tools in my toolbox (with focus on ML Training)
salesforce/jaxformer
Minimal library to train LLMs on TPU in JAX with pjit().
NVIDIA/JAX-Toolbox
JAX-Toolbox
EleutherAI/oslo
OSLO: Open Source for Large-scale Optimization
ryderr/git-poetry
you push me, I pull.
huggingface/bloom-jax-inference
ramybaly/News-Media-Reliability
commoncrawl/cc-notebooks
Various Jupyter notebooks about Common Crawl data
minqi/wordcraft
An environment for benchmarking commonsense agents
jeffistyping/hellasus