mobicham

Mobius Labs GmbHBerlin, Germany

Pinned Repositories

transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python141k 1.1k 16.9k28.3k
ao
PyTorch native quantization and sparsity for training and inference
Language:Python00
gemlite
Fast low-bit matmul kernels in Triton
Language:Python00
hqq
Official implementation of Half-Quadratic Quantization (HQQ)
Language:Python00
sglang
SGLang is a fast serving framework for large language models and vision language models.
Language:Python0 0 00
gemlite
Fast low-bit matmul kernels in Triton
Language:Python263 8 1321
hqq
Official implementation of Half-Quadratic Quantization (HQQ)
Language:Python765 16 12279
low-rank-llama2
Low-Rank Llama Custom Training
Language:Python22 4 21
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python88k 1.8k 49.6k23.6k
triton
Development repository for the Triton language and compiler
Language:MLIR14.9k 197 1.7k1.9k

mobicham's Repositories

mobicham/ao
PyTorch native quantization and sparsity for training and inference
Language:Python00
mobicham/gemlite
Fast low-bit matmul kernels in Triton
Language:Python00
mobicham/hqq
Official implementation of Half-Quadratic Quantization (HQQ)
Language:Python00
mobicham/sglang
SGLang is a fast serving framework for large language models and vision language models.
Language:Python0 0 00