cduk

cduk's Stars

stanfordnlp/dspy
DSPy: The framework for programming—not prompting—language models
Language:Python20.5k 152 8901.5k
facefusion/facefusion
Industry leading face manipulation platform
Language:Python20.5k 190 03.2k
fishaudio/fish-speech
SOTA Open Source TTS
Language:Python17.8k 110 4721.3k
KwaiVGI/LivePortrait
Bring portraits to life!
Language:Python13.4k 123 3931.4k
voideditor/void
Language:TypeScript8.7k 57 86450
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Language:Python6.4k 55 189563
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++5.9k 109 1.2k1k
leejet/stable-diffusion.cpp
Stable Diffusion and Flux in pure C/C++
Language:C++3.6k 54 293312
prabirshrestha/vim-lsp
async language server protocol plugin for vim and neovim
Language:Vim Script3.2k 47 667306
mishushakov/llm-scraper
Turn any webpage into structured data using LLMs
Language:TypeScript2.6k 17 27151
codelion/optillm
Optimizing inference proxy for LLMs
Language:Python1.8k 23 48144
pytorch/ao
PyTorch native quantization and sparsity for training and inference
Language:Python1.7k 43 337187
pentacent/keila
Open Source Newsletter Tool.
Language:Elixir1.5k 18 27285
mattn/vim-lsp-settings
Auto configurations for Language Server for vim-lsp
Language:Vim Script1.3k 18 218233
matatonic/openedai-speech
An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.
Language:Python559 10 6780
BretFisher/docker-vackup
Script to easily backup and restore docker volumes
Language:Shell416 12 1070
facefusion/facefusion-docker
Industry leading face manipulation platform
354 7 0131
TheBlewish/Web-LLM-Assistant-Llamacpp-Ollama
A Python-based web-assisted large language model (LLM) search assistant using Llama.cpp
Language:Python306 10 738
facefusion/facefusion-assets
Industry leading face manipulation platform
257 11 060
Agent-Tools/awesome-autonomous-web
214 4 017
CubicalBatch/deaddit
If Reddit's content was completely AI-generated.
Language:Python214 5 820
ModelCloud/GPTQModel
Production ready LLM model compression/quantization toolkit with accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.
Language:Python168 3 7730
BorealisAI/flora-opt
This is the official repository for the paper "Flora: Low-Rank Adapters Are Secretly Gradient Compressors" in ICML 2024.
Language:Python84 3 65
perk11/large-model-proxy
Large Model Proxy is designed to make it easy to run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources. It listens on a dedicated port for each proxied LM, making them always available to the clients connecting to these ports.
Language:Go48 2 163
sasha0552/nvidia-pstated
A daemon that automatically manages the performance states of NVIDIA GPUs.
Language:C46 1 34
michaelfeil/embed
A stable, fast and easy-to-use inference library with a focus on a sync-to-async API
45 2 12
cduk/vllm-pascal
A fork of vLLM enabling Pascal architecture GPUs
Language:Python24 1 01
diegovelilla/reddit-omni
AI Reddit bot that scrapes subreddits for questions, conducts research, and posts automated answers to help users with relevant information.
Language:Python11 2 00
merefield/discourse-frotz
A plugin that uses Frotz to give you an interactive fiction experience on your Discourse forum
Language:Ruby5 4 02
the-crypt-keeper/llama-srb-api
Single Request Batching API server backed by llama.cpp
Language:Python4 2 0