sunkinux

sunkinux's Stars

open-mmlab/mim
MIM Installs OpenMMLab Packages
Language:Python35968
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
Language:Python6.7k670
lquesada/ComfyUI-Inpaint-CropAndStitch
ComfyUI nodes to crop before sampling and stitch back after sampling that speed up inpainting
Language:Python52234
DepthAnything/Depth-Anything-V2
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Language:Python4.7k418
xuebinqin/DIS
This is the repo for our new project Highly Accurate Dichotomous Image Segmentation
Language:Jupyter Notebook2.4k274
liewjunhao/thin-object-selection
Deep Interactive Thin Object Selection
Language:Python899
gligen/GLIGEN
Open-Set Grounded Text-to-Image Generation
Language:Python2.1k156
OptimalScale/DetGPT
Language:Jupyter Notebook76971
imgproxy/imgproxy
Fast and secure standalone server for resizing and converting remote images
Language:Go9.3k650
fatedier/frp
A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.
Language:Go91k13.8k
logtd/ComfyUI-Fluxtapoz
Nodes for image juxtaposition for Flux in ComfyUI
Language:Python1.1k46
xinntao/Real-ESRGAN
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Language:Python29.8k3.7k
ckkelvinchan/RealBasicVSR
Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"
Language:Python951137
shadowcz007/comfyui-mixlab-nodes
Workflow-to-APP、ScreenShare&FloatingVideo、GPT & 3D、SpeechRecognition&TTS
Language:JavaScript1.5k96
lobehub/lobe-vidol
🧸 Lobe Vidol - Making Virtual Idols Accessible for EveryOne
Language:TypeScript64998
papulke/face-of-art
Code for "The Face of Art: Landmark Detection and Geometric Style in Portraits"
Language:Jupyter Notebook26531
yzhou359/MakeItTalk
Language:Jupyter Notebook994218
pkhungurn/talking-head-anime-4-demo
Demo Programs for the "Talking Head(?) Anime from a Single Image 4: Improved Models and Its Distillation" Project
Language:Python19721
GiusTex/ComfyUI-DiffusersImageOutpaint
Diffusers Image Outpaint for ComfyUI
Language:Python704
k2-fsa/sherpa-onnx
Speech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter, Object Pascal, Lazarus, Rust
Language:C++5.1k566
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
Language:Python4.7k423
swordswind/ai_virtual_mate_web
AI虚拟伙伴Web版
Language:Python28445
nladuo/live2d-chatbot-demo
A live2D chatbot Demo build with python and js.
Language:Jupyter Notebook717
kleinlee/DH_live
每个人都能用的数字人
Language:Python1.1k231
tencent-ailab/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
Language:Python2.3k290
AIFSH/ComfyUI-Hallo
Language:Python29716
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language:Python8.3k1.1k
kyutai-labs/moshi
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
Language:Python7.6k611
ossrs/srs
SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181.
Language:C++26.4k5.4k
lipku/LiveTalking
Real time interactive streaming digital human
Language:Python4.7k694