cangoksen

MicrosoftSeatlle, WA

cangoksen's Stars

lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python37.3k 352 1.8k4.6k
karpathy/LLM101n
LLM101n: Let's build a Storyteller
30.6k 2.5k 01.7k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python14.8k 123 1.2k1.4k
dair-ai/ML-Papers-of-the-Week
🔥Highlighting the top ML papers every week.
10.4k 876 6621
jadore801120/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Language:Python8.9k 97 1812k
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Language:Python8.1k 98 1.7k997
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Language:Python6.7k 65 84374
NVIDIA/cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
Language:C6.6k 121 2461.9k
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++5.9k 109 1.2k1k
linux-surface/linux-surface
Linux Kernel for Surface Devices
Language:Shell5.3k 127 1.3k227
microsoft/Codex-CLI
CLI tool that uses Codex to turn natural language commands into their Bash/ZShell/PowerShell equivalents
Language:Python2k 34 85183
openai/blocksparse
Efficient GPU kernels for block-sparse matrix multiplication and convolution
Language:Cuda1k 197 48202
davidtvs/pytorch-lr-finder
A learning rate range test implementation in PyTorch
Language:Python934 14 61121
okankop/Efficient-3DCNNs
PyTorch Implementation of "Resource Efficient 3D Convolutional Neural Networks", codes and pretrained models.
Language:Python783 14 44152
lucidrains/linear-attention-transformer
Transformer based on a variant of attention that is linear complexity in respect to sequence length
Language:Python714 13 2068
tomaarsen/attention_sinks
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining
Language:Python683 11 3040
LeapLabTHU/Agent-Attention
Official repository of Agent Attention (ECCV2024)
Language:Python567 4 4638
lucidrains/local-attention
An implementation of local windowed attention for language modeling
Language:Python396 5 1941
tmabraham/diffusion_reading_group
Diffusion Reading Group at EleutherAI
Language:Jupyter Notebook315 28 117
prprbr/awesome-lifelong-continual-learning
A list of papers, blogs, datasets and software in the field of lifelong/continual machine learning
282 16 144
facebookresearch/SpinQuant
Code repo for the paper "SpinQuant LLM quantization with learned rotations"
Language:Python190 8 2018
pprp/Awesome-LLM-Quantization
Awesome list for LLM quantization
Language:Python135 7 010
lezcano/expRNN
Optimization with orthogonal constraints and on general manifolds
Language:Python126 6 521
Dao-AILab/fast-hadamard-transform
Fast Hadamard transform in CUDA, with a PyTorch interface
Language:C124 4 617
AlexanderMath/fasth
Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.
Language:Python70 4 410
VaticanCameos99/knowledge-distillation-for-unet
An implementation of Knowledge distillation for segmentation, to train a small (student) UNet from a larger (teacher) UNet thereby reducing the size of the network while achieving performance similar to the heavier model.
Language:Python51 2 213
MattShannon/bandmat
A banded matrix library for python.
Language:Python26 6 109
hhb072/OrthogonalTransformer
Language:Python10 3 20
cdluminate/cdluminate
Language:TeX5 2 10
Goutam-Kelam/LayerOut
A new regularization technique that freezes the layers of the deep neural networks stochastically.
Language:Python4 2 00

cangoksen

cangoksen's Stars

lm-sys/FastChat

karpathy/LLM101n

Dao-AILab/flash-attention

dair-ai/ML-Papers-of-the-Week

jadore801120/attention-is-all-you-need-pytorch

huggingface/accelerate

mit-han-lab/streaming-llm

NVIDIA/cuda-samples

NVIDIA/cutlass

linux-surface/linux-surface

microsoft/Codex-CLI

openai/blocksparse

davidtvs/pytorch-lr-finder

okankop/Efficient-3DCNNs

lucidrains/linear-attention-transformer

tomaarsen/attention_sinks

LeapLabTHU/Agent-Attention

lucidrains/local-attention

tmabraham/diffusion_reading_group

prprbr/awesome-lifelong-continual-learning

facebookresearch/SpinQuant

pprp/Awesome-LLM-Quantization

lezcano/expRNN

Dao-AILab/fast-hadamard-transform

AlexanderMath/fasth

VaticanCameos99/knowledge-distillation-for-unet

MattShannon/bandmat

hhb072/OrthogonalTransformer

cdluminate/cdluminate

Goutam-Kelam/LayerOut