sunkinux's Stars
open-mmlab/mim
MIM Installs OpenMMLab Packages
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
lquesada/ComfyUI-Inpaint-CropAndStitch
ComfyUI nodes to crop before sampling and stitch back after sampling that speed up inpainting
DepthAnything/Depth-Anything-V2
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
xuebinqin/DIS
This is the repo for our new project Highly Accurate Dichotomous Image Segmentation
liewjunhao/thin-object-selection
Deep Interactive Thin Object Selection
gligen/GLIGEN
Open-Set Grounded Text-to-Image Generation
OptimalScale/DetGPT
imgproxy/imgproxy
Fast and secure standalone server for resizing and converting remote images
fatedier/frp
A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.
logtd/ComfyUI-Fluxtapoz
Nodes for image juxtaposition for Flux in ComfyUI
xinntao/Real-ESRGAN
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
ckkelvinchan/RealBasicVSR
Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"
shadowcz007/comfyui-mixlab-nodes
Workflow-to-APP、ScreenShare&FloatingVideo、GPT & 3D、SpeechRecognition&TTS
lobehub/lobe-vidol
🧸 Lobe Vidol - Making Virtual Idols Accessible for EveryOne
papulke/face-of-art
Code for "The Face of Art: Landmark Detection and Geometric Style in Portraits"
yzhou359/MakeItTalk
pkhungurn/talking-head-anime-4-demo
Demo Programs for the "Talking Head(?) Anime from a Single Image 4: Improved Models and Its Distillation" Project
GiusTex/ComfyUI-DiffusersImageOutpaint
Diffusers Image Outpaint for ComfyUI
k2-fsa/sherpa-onnx
Speech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter, Object Pascal, Lazarus, Rust
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
swordswind/ai_virtual_mate_web
AI虚拟伙伴Web版
nladuo/live2d-chatbot-demo
A live2D chatbot Demo build with python and js.
kleinlee/DH_live
每个人都能用的数字人
tencent-ailab/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
AIFSH/ComfyUI-Hallo
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
kyutai-labs/moshi
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
ossrs/srs
SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181.
lipku/LiveTalking
Real time interactive streaming digital human