mathemakitten

Research Engineer @NVIDIA

@NVIDIASan Francisco + Toronto

mathemakitten's Stars

google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
26.6k 285 412.2k
srush/GPU-Puzzles
Solve puzzles. Learn CUDA.
Language:Jupyter Notebook8.8k 43 31535
elyase/awesome-gpt3
4.6k 163 19357
thunlp/PLMpapers
Must-read Papers on pre-trained language models.
3.3k 149 11437
young-geng/EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
Language:Python2.4k 42 88253
microsoft/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python1.8k 24 182342
lightswitch05/hosts
Hostfile blocklist for ads and tracking, updated regularly
1.5k 36 40075
microsoft/TextWorld
TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.
Language:Jupyter Notebook1.2k 38 82187
Separius/awesome-fast-attention
list of efficient attention modules
Language:Python990 32 3109
bigscience-workshop/bigscience
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
Language:Shell975 38 19101
unitaryai/detoxify
Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unitary.ai.
Language:Python931 15 63114
huggingface/transformers-bloom-inference
Fast Inference Solutions for BLOOM
Language:Python557 12 64112
UDST/urbansim
Platform for building statistical models of cities and regions
Language:Python479 79 58131
akamhy/waybackpy
Wayback Machine API interface & a command-line tool
Language:Python465 10 8334
nivbend/gitstery
A Git Murder Mystery
441 8 025
NVIDIA/NeMo-Megatron-Launcher
NeMo Megatron launcher and tools
Language:Python389 19 29112
bigcode-project/bigcode-dataset
Language:Jupyter Notebook359 9 3961
google/CommonLoopUtils
CLU lets you write beautiful training loops in JAX.
Language:Jupyter Notebook320 10 031
huggingface/datablations
Scaling Data-Constrained Language Models
Language:Jupyter Notebook313 33 719
bigscience-workshop/data-preparation
Code used for sourcing and cleaning the BigScience ROOTS corpus
Language:Jupyter Notebook300 24 1240
stas00/toolbox
Essential guides and programming tools in my toolbox (with focus on ML Training)
Language:Python300 7 212
salesforce/jaxformer
Minimal library to train LLMs on TPU in JAX with pjit().
Language:Python271 9 2236
NVIDIA/JAX-Toolbox
JAX-Toolbox
Language:Jupyter Notebook231 23 17844
EleutherAI/oslo
OSLO: Open Source for Large-scale Optimization
Language:Python172 5 6929
ryderr/git-poetry
you push me, I pull.
165 15 338
huggingface/bloom-jax-inference
Language:Python64 16 39
ramybaly/News-Media-Reliability
Language:Python54 5 435
commoncrawl/cc-notebooks
Various Jupyter notebooks about Common Crawl data
Language:Jupyter Notebook44 18 29
minqi/wordcraft
An environment for benchmarking commonsense agents
Language:Python28 3 07
jeffistyping/hellasus
2 1 10

mathemakitten

mathemakitten's Stars

google-research/tuning_playbook

srush/GPU-Puzzles

elyase/awesome-gpt3

thunlp/PLMpapers

young-geng/EasyLM

microsoft/Megatron-DeepSpeed

lightswitch05/hosts

microsoft/TextWorld

Separius/awesome-fast-attention

bigscience-workshop/bigscience

unitaryai/detoxify

huggingface/transformers-bloom-inference

UDST/urbansim

akamhy/waybackpy

nivbend/gitstery

NVIDIA/NeMo-Megatron-Launcher

bigcode-project/bigcode-dataset

google/CommonLoopUtils

huggingface/datablations

bigscience-workshop/data-preparation

stas00/toolbox

salesforce/jaxformer

NVIDIA/JAX-Toolbox

EleutherAI/oslo

ryderr/git-poetry

huggingface/bloom-jax-inference

ramybaly/News-Media-Reliability

commoncrawl/cc-notebooks

minqi/wordcraft

jeffistyping/hellasus