MonadKai's Stars
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
statsd/statsd
Daemon for easy but powerful stats aggregation
stas00/ml-engineering
Machine Learning Engineering Open Book
vladmandic/automatic
SD.Next: Advanced Implementation Generative Image Models
fastapi-users/fastapi-users
Ready-to-use and customizable users management for FastAPI
SchedMD/slurm
Slurm: A Highly Scalable Workload Manager
blmoistawinde/HarvestText
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
predibase/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
microsoft/aici
AICI: Prompts as (Wasm) Programs
kingjulio8238/Memary
The Open Source Memory Layer For Autonomous Agents
facebookincubator/submitit
Python 3.8+ toolbox for submitting jobs to Slurm
XAMPPRocky/octocrab
A modern, extensible GitHub API Client for Rust.
MDK8888/GPTFast
Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.
distantmagic/paddler
Stateful load balancer custom-tailored for llama.cpp 🏓🦙
mit-han-lab/distrifuser
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
MooreThreads/torch_musa
torch_musa is an open source repository based on PyTorch, which can make full use of the super computing power of MooreThreads graphics cards.
awolverp/cachebox
The fastest memoizing and caching Python library written in Rust.
huggingface/llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
codingonion/awesome-cuda-and-hpc
🔥🔥🔥 A collection of some awesome public CUDA, cuBLAS, TensorRT and High Performance Computing (HPC) projects.
HPMLL/BurstGPT
A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems
Dao-AILab/fast-hadamard-transform
Fast Hadamard transform in CUDA, with a PyTorch interface
intel/llm-on-ray
Pretrain, finetune and serve LLMs on Intel platforms with Ray
wangsiping97/FastGEMV
High-speed GEMV kernels, at most 2.7x speedup compared to pytorch baseline.
allegro/allms
A versatile and powerful library designed to streamline the process of querying LLMs
MDK8888/vllmini
A minimal implementation of vllm.
Aaronhuang-778/SliM-LLM
SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models
huggingface/pyo3-special-method-derive
Automatically derive Python dunder methods for your Rust code
huggingface/tei-gaudi
A blazing fast inference solution for text embeddings models
pykeio/speech-synthesis
Common Rust traits for speech synthesis