Qubitium
Golang, Python, Kotlin, Swift. I prefer strongly typed languages and I do not worship PEP. @ModelCloudAi
ModelCloud.aiEarth/Epoch 2.0
Qubitium's Stars
Aider-AI/aider
aider is AI pair programming in your terminal
allegroai/clearml
ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution
microsoft/TinyTroupe
LLM-powered multiagent persona simulation for imagination enhancement and business insights.
intel/intel-extension-for-pytorch
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
evalplus/evalplus
Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024
ashvardanian/SimSIMD
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐
trotsky1997/MathBlackBox
mit-han-lab/nunchaku
SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
makeecat/Peng
A minimal quadrotor autonomy framework in Rust (Mac, Linux, Windows)
AGI-Arena/MARS
The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models
spcl/QuaRot
Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.
bigcode-project/bigcodebench
BigCodeBench: Benchmarking Code Generation Towards AGI
KellerJordan/Muon
Muon optimizer for neural networks: >30% extra sample efficiency, <3% wallclock overhead
ModelCloud/GPTQModel
Production ready LLM model compression/quantization toolkit with accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.
haonan3/AnchorContext
AnchorAttention: Improved attention for LLMs long-context training
THUDM/Android-Lab
bloc97/DeMo
DeMo: Decoupled Momentum Optimization
Jellyfish042/Sudoku-RWKV
xdit-project/mochi-xdit
faster parallel inference of mochi-1 video generation model
amd/ZenDNN
huggingface/optimum-graphcore
Blazing fast training of 🤗 Transformers on Graphcore IPUs
ModelTC/Outlier_Suppression_Plus
Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling
gorse-io/goat
Go assembly transpiler for C programming language
gonglinyuan/safim
MooreThreads/mutlass
MUSA Templates for Linear Algebra Subroutines
graphcore/Gradient-HuggingFace
Tasks and tutorials using Graphore's IPU with Hugging Face. Originally at https://github.com/gradient-ai/Graphcore-HuggingFace
ModelCloud/Device-SMI
Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separate tools such as nvidia-smi or /proc/cpuinfo and parsing it yourself.
IST-DASLab/ISTA-DASLab-Optimizers
waefrebeorn/GradRetentionNet
GradRetentionNet: A Research Project Exploring Persistent Gradient Descent for Improved Global Optimization
NonvolatileMemory/fast_llm_sampling
fast sampling from categorical distribution