Pinned Repositories
cuda-utils
CUDA utilties/helpers for simplifying multidimensional indexing
ffvec
haystack-embedded
haystackdb
megablock
Mega🅱️lock
octomul
Reasonably fast (compared to cublas) and relatively simple int8 tensor core gemm
Paul-Graham-Essays
quadmul
a fast and customizable CUDA int4 tensor core gemm
safetensors.cpp
Zero Dependency LibTorch Safetensors Loading and Storing in C++
TF2-HRNet
carsonpo's Repositories
carsonpo/haystackdb
carsonpo/megablock
Mega🅱️lock
carsonpo/ffvec
carsonpo/safetensors.cpp
Zero Dependency LibTorch Safetensors Loading and Storing in C++
carsonpo/haystack-embedded
carsonpo/octomul
Reasonably fast (compared to cublas) and relatively simple int8 tensor core gemm
carsonpo/TF2-HRNet
carsonpo/Paul-Graham-Essays
carsonpo/quadmul
a fast and customizable CUDA int4 tensor core gemm
carsonpo/cuda-utils
CUDA utilties/helpers for simplifying multidimensional indexing
carsonpo/octoquadmul
carsonpo/whisper-turbo
Cross-Platform, GPU Accelerated Whisper 🏎️
carsonpo/arxivist
carsonpo/candle
Minimalist ML framework for Rust
carsonpo/FeedbackButton
Stripe-esque Feedback Button
carsonpo/flambientgan
carsonpo/TwilightGAN
carsonpo/WebGPT
Run GPT model on the browser with WebGPU. An implementation of GPT inference in less than ~2000 lines of vanilla Javascript.
carsonpo/ratchet
A cross-platform browser ML framework.