Pinned Repositories
donkey
self driving car
flash-attention
Fast and memory-efficient exact attention
Liger-Kernel
Efficient Triton Kernels for LLM Training
tensorflow
An Open Source Machine Learning Framework for Everyone
cupy
NumPy & SciPy for GPU
flash-attention
Fast and memory-efficient exact attention
xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Liger-Kernel
Efficient Triton Kernels for LLM Training
tensorflow
An Open Source Machine Learning Framework for Everyone
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
arvindsun's Repositories
arvindsun/donkey
self driving car
arvindsun/flash-attention
Fast and memory-efficient exact attention
arvindsun/Liger-Kernel
Efficient Triton Kernels for LLM Training
arvindsun/tensorflow
An Open Source Machine Learning Framework for Everyone