jiaqianjing's Stars
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
deepinsight/insightface
State-of-the-art 2D and 3D Face Analysis Project
songquanpeng/one-api
OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.
fishaudio/fish-speech
Brand new TTS solution
davisking/dlib
A toolkit for making real world machine learning and data analysis applications in C++
instantX-research/InstantID
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
facebookresearch/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
intitni/CopilotForXcode
The first GitHub Copilot, Codeium and ChatGPT Xcode Source Editor Extension
Huanshere/VideoLingo
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
OwO-Network/DeepLX
Powerful Free DeepL API, No Token Required
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
RROrg/rr
Redpill Recovery (arpl-i18n)
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
TencentARC/InstantMesh
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
AiuniAI/Unique3D
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
BadToBest/EchoMimic
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
ToTheBeginning/PuLID
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Fannovel16/comfyui_controlnet_aux
ComfyUI's ControlNet Auxiliary Preprocessors
xinsir6/ControlNetPlus
ControlNet++: All-in-one ControlNet for image generations and editing!
instantX-research/InstantStyle
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
zhulu111/ComfyUI_Bxb
SD变现宝:一键把comfyui工作流转换成小程序。
ali-vilab/MimicBrush
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
11cafe/comfyui-workspace-manager
A ComfyUI workflows and models management extension to organize and manage all your workflows, models in one place. Seamlessly switch between workflows, as well as import, export workflows, reuse subworkflows, install models, browse your models in a single workspace
yolain/ComfyUI-Easy-Use
In order to make it easier to use the ComfyUI, I have made some optimizations and integrations to some commonly used nodes.
kuoruan/openwrt-frp
Frpc & Frps for OpenWrt
kijai/ComfyUI-FluxTrainer
Picsart-AI-Research/LIVE-Layerwise-Image-Vectorization
[CVPR 2022 Oral] Towards Layer-wise Image Vectorization
psychopasss/Synology-Lrc-Plugin-For-QQ-Music
用于群晖 Audio Station/DS Audio 的歌词插件 power by QQ music 🙂
siliconflow/BizyAir
BizyAir: Comfy Nodes that can run in any environment.