skrider

UC Berkeley EECS | ML infra

Tesla Autopilot

Pinned Repositories

6.824-labs
labs for MIT 6.824 distributed systems
Language:Go0 1 00
61c-matrix-tools
Tools for debugging the cs61c classify project
Language:Assembly0 1 00
async-recursion-monotonic-list-contrived
contrived example of an async recursive function with a monotonically growing immutable list
Language:Rust0 1 00
brain4d
Note management utility
0 1 00
braind
Utility for managing my markdown notes
Language:TypeScript0 1 00
cuda-bitonic-merge
Language:C++0 1 00
dotfiles
Language:Python0 1 00
flash-attention
Fast and memory-efficient exact attention
Language:Python2 0 00
softgrep
Code search with tree-sitter + semantic search
Language:Go3 1 00
speculative-forecasting
Experiments with controlling how many tokens to predict for speculative decoding
Language:Jupyter Notebook0 1 00

skrider's Repositories

skrider/softgrep
Code search with tree-sitter + semantic search
Language:Go3 1 00
skrider/flash-attention
Fast and memory-efficient exact attention
Language:Python2 0 00
skrider/6.824-labs
labs for MIT 6.824 distributed systems
Language:Go0 1 00
skrider/async-recursion-monotonic-list-contrived
contrived example of an async recursive function with a monotonically growing immutable list
Language:Rust0 1 00
skrider/brain4d
Note management utility
0 1 00
skrider/chat-with-gpt
An open-source ChatGPT app with a voice
Language:TypeScript0 0 00
skrider/crossgrep
Cross-encoding AST queries with LLMs for dense retrieval
Language:Rust0 0 00
skrider/cten
CUDA tensor library from scratch for practice
Language:C++0 1 00
skrider/cuda-bitonic-merge
Language:C++0 1 00
skrider/cuda-workshop
cuda-workshop
0 1 00
skrider/dotfiles
Language:Python0 1 00
skrider/draftsman
CS285 final project
Language:Jupyter Notebook00
skrider/serverless-model-example
Practice project demonstrating how to serve copies of a single model efficiently and autoscale on demand
Language:Go0 1 00
skrider/speculative-forecasting
Experiments with controlling how many tokens to predict for speculative decoding
Language:Jupyter Notebook0 1 00
skrider/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python0 0 00
skrider/candle
Minimalist ML framework for Rust
Language:Rust0 0
skrider/cs285-project
CS 285 Final Project
1 0
skrider/eecs182-midterm-review
eecs182-midterm-review
Language:Makefile1 01
skrider/flashinfer
FlashInfer: Kernel Library for LLM Serving
Language:Cuda0 0
skrider/kernel-introspection
Language:Jupyter Notebook1 0
skrider/kubernetes-examples
Kubernetes application example tutorials
Language:Shell0 0
skrider/labs
labs
1 0
skrider/mq-paged-attention
Mutli query paged attention kernel
1 0
skrider/paged_flash_attention_inference
Paged flash attention
skrider/pip-prune
Tool for automatically minimizing python dependencies as much as possible
Language:Go1 0
skrider/react-flow-onboard
Language:TypeScript1 0
skrider/resume
My resume template in latex
Language:TeX
skrider/serverless-sam
Deploying SAM on banana ml serverless
Language:Python1 0
skrider/serverless-scraper
Serverless web scraper + RAG backend
Language:Go1 0
skrider/torch-dynamo-experiments
Profiling models compiled with pytorch dynamo and inductor on a large body of models on an A10
Language:Python1 0