banjuede's Stars
leptonai/gpud
kubernetes/kubernetes
Production-Grade Container Scheduling and Management
ray-project/kuberay
A toolkit to run Ray applications on Kubernetes
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
apple/ToolSandbox
karmada-io/karmada
Open, Multi-Cloud, Multi-Cluster Kubernetes Orchestration
coreweave/nccl-tests
NVIDIA NCCL Tests for Distributed Training
anthropics/hh-rlhf
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
mamba-org/mamba
The Fast Cross-Platform Package Manager
state-spaces/mamba
Mamba SSM architecture
predibase/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
flashinfer-ai/flashinfer
FlashInfer: Kernel Library for LLM Serving
BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
TaskingAI/TaskingAI
The open source platform for AI-native application development.
microsoft/TaskWeaver
A code-first agent framework for seamlessly planning and executing data analytics tasks.
codykrieger/gfxCardStatus
gfxCardStatus is an open-source menu bar application that keeps track of which graphics card your unibody, dual-GPU MacBook Pro is using at any given time, and allows you to switch between them on demand.
punica-ai/punica
Serving multiple LoRA finetuned LLM as one
facebook/rocksdb
A library that provides an embeddable, persistent key-value store for fast storage.
facebookresearch/NeuralDB
Database Reasoning Over Text project for ACL paper
apache/fury
A blazingly fast multi-language serialization framework powered by JIT and zero-copy.
notion-enhancer/notion-enhancer
An enhancer/customiser for the all-in-one productivity workspace Notion
aimhubio/aim
Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
cumulo-autumn/StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Dao-AILab/flash-attention
Fast and memory-efficient exact attention