fattorib

doing a bit of this and that

Toronto, Ontario

Pinned Repositories

lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python7.5k 39 1.2k2k
fast_sequential_scan
A fast sequential scan on GPU
Language:Cuda0 1 00
Flax-ResNets
CIFAR10 ResNets implemented in JAX+Flax
Language:Python11 1 01
hawk-pytorch
PyTorch implementation of Hawk from "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models" (https://arxiv.org/abs/2402.19427). Compatible with torch.compile.
Language:Python0 2 00
LeagueMatchScraper
Code to scrape League of Legends matches using the Riot Games API.
Language:Python7 1 00
Little-GPT
GPT* - Training faster small transformers using ALiBi, Parallel Residual Connections and more!
Language:Python23 3 10
RepVGG-CIFAR10
RepVGG models specifically for CIFAR10 and CIFAR 100. Based on RepVGG: Making VGG-style ConvNets Great Again (Ding et. al)
Language:Python6 1 01
transformer_shmap
Tensor Parallelism with JAX + Shard Map
Language:Python11 1 01
tritonformer
Trainable transformer with fwd+bwd ops in Triton, matching the performance of PyTorch + cuDNN/cuBLAS
Language:Python4 1 00
ZeRO-transformer
Two implementations of ZeRO-1 optimizer sharding in JAX
Language:Python13 2 60

fattorib's Repositories

fattorib/Little-GPT
GPT* - Training faster small transformers using ALiBi, Parallel Residual Connections and more!
Language:Python23 3 10
fattorib/ZeRO-transformer
Two implementations of ZeRO-1 optimizer sharding in JAX
Language:Python13 2 60
fattorib/Flax-ResNets
CIFAR10 ResNets implemented in JAX+Flax
Language:Python11 1 01
fattorib/transformer_shmap
Tensor Parallelism with JAX + Shard Map
Language:Python11 1 01
fattorib/LeagueMatchScraper
Code to scrape League of Legends matches using the Riot Games API.
Language:Python7 1 00
fattorib/RepVGG-CIFAR10
RepVGG models specifically for CIFAR10 and CIFAR 100. Based on RepVGG: Making VGG-style ConvNets Great Again (Ding et. al)
Language:Python6 1 01
fattorib/tritonformer
Trainable transformer with fwd+bwd ops in Triton, matching the performance of PyTorch + cuDNN/cuBLAS
Language:Python4 1 00
fattorib/fusedswiglu
Fused SwiGLU Triton kernels
Language:Python3 1 10
fattorib/StochasticDepthNets
PyTorch implementation of ResNet110 as described in Deep Networks with Stochastic Depth (Huang et al.)
Language:Python2 1 00
fattorib/wtf-wikipedia-python
raw wikipedia XML to LM_Dataformat in under 4 hours
Language:Python2 1 01
fattorib/Monte-Carlo-Fractal-Dimensionality
Efficient algorithm using random sampling to calculate the dimension of many basic fractals. Implemented algorithm in Python.
Language:Python1 0 01
fattorib/picograd
picograd - Fully connected neural networks in Python
Language:Python1 1 00
fattorib/fast_sequential_scan
A fast sequential scan on GPU
Language:Cuda0 1 00
fattorib/GeometricDeepLearning
Introductory Geometric Deep Learning Presentation from September 2021
0 1 00
fattorib/hawk-pytorch
PyTorch implementation of Hawk from "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models" (https://arxiv.org/abs/2402.19427). Compatible with torch.compile.
Language:Python0 2 00
fattorib/Python-Unigram
Unigram tokenization algorithm in Python
Language:Python0 1 01
fattorib/CudaSoftmax
Softmax CUDA kernel :)
Language:Cuda1 0
fattorib/fattorib.github.io
Website
Language:HTML0 0
fattorib/Fundamental-Domain
Code to generate a section of the fundamental domain for the action of the special linear group on the space of (integral) binary cubic forms. As it stands, the code is quite inefficient. In the future I hope to optimize it.
Language:MATLAB1 0
fattorib/InfoGAN-Jax
InfoGAN in Jax with small Gradio app
Language:Python1 0
fattorib/jaxvae
Variational Autoencoder in JAX
Language:Python1 0
fattorib/lm-evaluation-harness
Fork of lm-evaluation-harness for evaluating my custom models
Language:Python0 0
fattorib/Python-BPE
I wrote Byte-Pair encoding but its 600x slower than 🤗
Language:Python1 0
fattorib/ResNets-CIFAR10
PyTorch implementation of the CIFAR10 ResNets, based on Deep Residual Learning for Image Recognition (He et al.)
Language:Python1 0

fattorib

Pinned Repositories

lm-evaluation-harness

fast_sequential_scan

Flax-ResNets

hawk-pytorch

LeagueMatchScraper

Little-GPT

RepVGG-CIFAR10

transformer_shmap

tritonformer

ZeRO-transformer

fattorib's Repositories

fattorib/Little-GPT

fattorib/ZeRO-transformer

fattorib/Flax-ResNets

fattorib/transformer_shmap

fattorib/LeagueMatchScraper

fattorib/RepVGG-CIFAR10

fattorib/tritonformer

fattorib/fusedswiglu

fattorib/StochasticDepthNets

fattorib/wtf-wikipedia-python

fattorib/Monte-Carlo-Fractal-Dimensionality

fattorib/picograd

fattorib/fast_sequential_scan

fattorib/GeometricDeepLearning

fattorib/hawk-pytorch

fattorib/Python-Unigram

fattorib/CudaSoftmax

fattorib/fattorib.github.io

fattorib/Fundamental-Domain

fattorib/InfoGAN-Jax

fattorib/jaxvae

fattorib/lm-evaluation-harness

fattorib/Python-BPE

fattorib/ResNets-CIFAR10