KellerJordan

Berkeley, California

Pinned Repositories

Autoencoder-Clustering
Replication of "Auto-encoder Based Data Clustering" Song et al
Language:Jupyter Notebook26 4 08
CapsNet-Adversarial
Capsule networks can defend against adversarial attacks using reconstruction error
Language:Jupyter Notebook13 2 12
cifar10-airbench
94% on CIFAR-10 in 2.6 seconds 💨 96% in 27 seconds
Language:Python178 2 09
Evaluate-CrossMax-Ensemble
An evaluation of the robust accuracy of the CrossMax Ensemble technique (Fort et al., 2024)
Language:Python0 1 00
modded-nanogpt
NanoGPT (124M) in 5 minutes
Language:Python1.5k 20 17122
Muon
Muon optimizer for neural networks: >30% extra sample efficiency, <3% wallclock overhead
Language:Python1213
REPAIR
Code release for REPAIR: REnormalizing Permuted Activations for Interpolation Repair
Language:Jupyter Notebook45 2 17
ResNet-PyTorch-CIFAR10
PyTorch implementation of residual networks trained on CIFAR-10 dataset (2017)
Language:Python31 3 17
TriMap-PyTorch
Implementation of TriMap dimensionality reduction in PyTorch
Language:Python16 2 04
tSNE-Animation
Hacking sklearn's t-SNE implementation to animate embedding process
Language:Python53 3 18

KellerJordan's Repositories

KellerJordan/modded-nanogpt
NanoGPT (124M) in 5 minutes
Language:Python1.5k 20 17122
KellerJordan/cifar10-airbench
94% on CIFAR-10 in 2.6 seconds 💨 96% in 27 seconds
Language:Python178 2 09
KellerJordan/Muon
Muon optimizer for neural networks: >30% extra sample efficiency, <3% wallclock overhead
Language:Python1213
KellerJordan/REPAIR
Code release for REPAIR: REnormalizing Permuted Activations for Interpolation Repair
Language:Jupyter Notebook45 2 17
KellerJordan/hlb-CIFAR10
Train to 94% on CIFAR-10 in 4.4 seconds on a single A100
Language:Python12 1 01
KellerJordan/top-sgd
Optimization algorithm which fits a ResNet to CIFAR-10 5x faster than SGD / Adam (with terrible generalization)
Language:Python12 1 02
KellerJordan/cifar10-loader
Fast and easy to use CIFAR-10 dataloader
Language:Python6 1 00
KellerJordan/Exponentiated-Gradient-PyTorch
EG plus/minus optimizer implemented in PyTorch
Language:Python3 3 10
KellerJordan/CIFAR-cuda
welcome to the learning zone
Language:Cuda2
KellerJordan/gpt-sandbox
Language:Python2
KellerJordan/elastic-airbench
Language:Python1 2 0
KellerJordan/negative-self-influence
neural networks don't minimize loss [caution: probably due to batchnorm]
Language:Python1 2 0
KellerJordan/research-airbench
Variant of cifar10-airbench which removes several tricks. Ideal for research
Language:Python1 2 0
KellerJordan/CIFAR10-isolated-rng
CIFAR-10 training script with separate seeds for model initialization, data ordering, and data augmentation
Language:Python0 2 00
KellerJordan/Evaluate-CrossMax-Ensemble
An evaluation of the robust accuracy of the CrossMax Ensemble technique (Fort et al., 2024)
Language:Python0 1 00
KellerJordan/pixelated-features-bugs
Code for training and evaluating the robustness of models using pixelated data
Language:Python0 1 00
KellerJordan/BatchNorm-adaptation-behavior
The adaptation behavior of BatchNorm is no different than Norm-Free
Language:Python
KellerJordan/ffcv-cifar
Train a large number of CIFAR-10 models using FFCV
Language:Python2 0
KellerJordan/ffcv-imagenet
Train ImageNet *fast* in 500 lines of code with FFCV -- forked for training only Resnet18s
Language:Python1 0
KellerJordan/flash-attention
Fast and memory-efficient exact attention
Language:Python1 0
KellerJordan/git-re-basin
Code release for "Git Re-Basin: Merging Models modulo Permutation Symmetries"
Language:Python1 0
KellerJordan/jupyter-fork
Language:Python
KellerJordan/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda0 0
KellerJordan/Megatron-LM
Ongoing research training transformer models at scale
Language:Python1 0
KellerJordan/MNIST-test
Language:Python
KellerJordan/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python0 0
KellerJordan/robustness-featsnotbugs
A replication of "Adversarial Examples Are Not Bugs, They Are Features" https://arxiv.org/abs/1905.02175
Language:Jupyter Notebook0 0
KellerJordan/share-data
Language:Python1 0
KellerJordan/trak
A fast, effective data attribution method for neural networks in PyTorch
Language:Python0 0
KellerJordan/zf-compiler
Language:Jupyter Notebook