ztsweet

ztsweet's Stars

Lightning-AI/LitServe
Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.
Language:Python2.5k160
zed-industries/zed
Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.
Language:Rust50.8k3.1k
mitchellh/libxev
libxev is a cross-platform, high-performance event loop that provides abstractions for non-blocking IO, timers, events, and more and works on Linux (io_uring or epoll), macOS (kqueue), and Wasm + WASI. Available as both a Zig and C API.
Language:Zig2.2k77
cdgriffith/Box
Python dictionaries with advanced dot notation access
Language:Python2.6k107
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training
Language:Python3.6k213
ziglang/zig-spec
Language:Zig17732
AdjectiveAllison/zvdb
Zig Vector Database!
Language:Zig6
fabioarnold/nanovg-zig
A small anti-aliased hardware-accelerated vector graphics library
Language:C18921
gusye1234/nano-vectordb
A simple, easy-to-hack Vector Database
Language:Python763
skyzh/mini-lsm
A tutorial of building an LSM-Tree storage engine in a week.
Language:Rust2.9k403
karthik-codex/Autogen_GraphRAG_Ollama
Microsoft's GraphRAG + AutoGen + Ollama + Chainlit = Fully Local & Free Multi-Agent RAG Superbot
Language:Python515104
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Language:Shell10.3k636
gusye1234/nano-graphrag
A simple, easy-to-hack GraphRAG implementation
Language:Python1.7k166
Hejsil/zig-clap
Command line argument parsing library
Language:Zig97569
pytorch/torchchat
Run PyTorch LLMs locally on servers, desktop and mobile
Language:Python3.4k225
THUDM/LongWriter
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Language:Python1.5k139
huggingface/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Language:Python19.3k2.7k
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Language:Python58.5k6.2k
aria2/aria2
aria2 is a lightweight multi-protocol & multi-source, cross platform download utility operated in command-line. It supports HTTP/HTTPS, FTP, SFTP, BitTorrent and Metalink.
Language:C++36k3.6k
quickwit-oss/quickwit
Cloud-native search engine for observability. An open-source alternative to Datadog, Elasticsearch, Loki, and Tempo.
Language:Rust8.3k341
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python14.4k1.4k
EricLBuehler/candle-vllm
Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.
Language:Rust26928
InfiniTensor/InfiniLM
Language:Rust6921
LlamaEdge/LlamaEdge
The easiest & fastest way to run customized and fine-tuned LLMs locally or on the edge
Language:Rust1.1k93
bbycroft/llm-viz
3D Visualization of an GPT-style LLM
Language:TypeScript4.1k450
CompendiumLabs/ziggy
Embedding, quantization, and vector indexing. Designed for speed.
Language:Python4
tensorlakeai/indexify
A realtime serving engine for Data-Intensive Generative AI Applications
Language:Rust932121
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Language:Python44.6k7.9k
BerriAI/litellm
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Language:Python14.4k1.7k
MistApproach/callm
Run Generative AI models directly on your hardware
Language:Rust22