scott-gray's Stars
keras-team/keras
Deep Learning for humans
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
apache/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
karpathy/convnetjs
Deep Learning in Javascript. Train Convolutional Neural Networks (or ordinary ones) in your browser.
openai/DALL-E
PyTorch package for the discrete VAE used for DALL·E.
plaidml/plaidml
PlaidML is a framework for making deep learning work everywhere.
NervanaSystems/neon
Intel® Nervana™ reference deep learning framework committed to best performance on all hardware
NVIDIA/nccl
Optimized primitives for collective multi-GPU communication
soumith/convnet-benchmarks
Easy benchmarking of all publicly accessible implementations of convnets
inducer/pycuda
CUDA integration for Python, plus shiny features
Maratyszcza/NNPACK
Acceleration package for neural networks on multi-core CPUs
dmlc/nnvm
lebedov/scikit-cuda
Python interface to GPU-powered libraries
likejazz/llama3.np
llama3.np is a pure NumPy implementation for Llama 3 model.
NervanaSystems/maxas
Assembler for NVIDIA Maxwell architecture
joschu/cgt
Computation Graph Toolkit
andravin/wincnn
Winograd minimal convolution algorithm generator for convolutional neural networks.
daadaada/turingas
Assembler for NVIDIA Volta and Turing GPUs
eBay/maxDNN
High Efficiency Convolution Kernel for Maxwell GPU Architecture
KrzysztofHajdamowicz/nami-viper
Various things I have learned about Nami Burn-E Viper electric scooter