xrsrke

Research Engineer @huggingface

@huggingfaceEarth

xrsrke's Stars

meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27.6k 230 2733.1k
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Language:Python7.9k 85 153794
lucidrains/x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
Language:Python4.9k 57 235425
pytorch/ao
PyTorch native quantization and sparsity for training and inference
Language:Python1.7k 44 337188
mit-han-lab/smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Language:Python1.3k 21 89151
pytorch/FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
Language:C++1.2k 65 171520
google-research/federated
A collection of Google research projects related to Federated Learning and Federated Analytics.
Language:Python696 27 47196
ironjr/grokfast
Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"
Language:Python528 13 1044
FranxYao/Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
Language:Python447 8 1729
apple/ml-sigma-reparam
Language:Python294 13 014
usyd-fsalab/fp6_llm
An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).
Language:Cuda220 6 1116
pytorch-labs/float8_experimental
This repository contains the experimental PyTorch native float8 training UX
Language:Python215 25 4720
pytorch-labs/applied-ai
Applied AI experiments and examples for PyTorch
Language:Python191 14 917
nbasyl/LLM-FP4
The official implementation of the EMNLP 2023 paper LLM-FP4
Language:Python168 5 1011
jundaf2/INT8-Flash-Attention-FMHA-Quantization
Language:Cuda156 5 517
albanD/subclass_zoo
Language:Jupyter Notebook152 12 2825
Qualcomm-AI-research/FP8-quantization
Language:Python127 7 510
athms/mad-lab
A MAD laboratory to improve AI architecture designs 🧪
Language:Python95 2 27
wimh966/outlier_suppression
The official PyTorch implementation of the NeurIPS2022 (spotlight) paper, Outlier Suppression: Pushing the Limit of Low-bit Transformer Language Models
Language:Python46 1 14
ROCm/aotriton
Ahead of Time (AOT) Triton Math Library
Language:Python44 10 1815
google-deepmind/asyncdiloco
Language:Jupyter Notebook39 6 03
Qualcomm-AI-research/outlier-free-transformers
Language:Python37 6 44
arogozhnikov/adamw_bfloat16
AdamW optimizer for bfloat16 models in pytorch 🔥.
Language:Python30 4 14
thu-ml/Jetfire-INT8Training
Language:Jupyter Notebook26 4 23
graphcore-research/pytorch-tensor-tracker
Flexibly track outputs and grad-outputs of torch.nn.Module.
Language:Python13 2 01
AmericanPresidentJimmyCarter/test-torch-bfloat16-vit-training
Language:Python9 3 01
honglu2875/thing
Catch your tensors in one program and quietly send to another live python session.
Language:Python6 1 31
carsonpo/octomul
Reasonably fast (compared to cublas) and relatively simple int8 tensor core gemm
Language:Cuda5 1 01
huggingface/bench_cluster
Language:Python4 4 02
Krishnateja244/Vanishing_Gradient
This repository helps in understanding vanishing gradient problem with visualization
Language:Jupyter Notebook2 1 01

xrsrke

xrsrke's Stars

meta-llama/llama3

huggingface/lerobot

lucidrains/x-transformers

pytorch/ao

mit-han-lab/smoothquant

pytorch/FBGEMM

google-research/federated

ironjr/grokfast

FranxYao/Long-Context-Data-Engineering

apple/ml-sigma-reparam

usyd-fsalab/fp6_llm

pytorch-labs/float8_experimental

pytorch-labs/applied-ai

nbasyl/LLM-FP4

jundaf2/INT8-Flash-Attention-FMHA-Quantization

albanD/subclass_zoo

Qualcomm-AI-research/FP8-quantization

athms/mad-lab

wimh966/outlier_suppression

ROCm/aotriton

google-deepmind/asyncdiloco

Qualcomm-AI-research/outlier-free-transformers

arogozhnikov/adamw_bfloat16

thu-ml/Jetfire-INT8Training

graphcore-research/pytorch-tensor-tracker

AmericanPresidentJimmyCarter/test-torch-bfloat16-vit-training

honglu2875/thing

carsonpo/octomul

huggingface/bench_cluster

Krishnateja244/Vanishing_Gradient