dengxb

dengxb's Stars

TadasBaltrusaitis/OpenFace
OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.
Language:MATLAB6.9k1.8k
rlawjdghek/StableVITON
[CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On
Language:Python973149
ali-vilab/AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
Language:Python3.9k359
lllyasviel/Fooocus
Focus on prompting and generating
Language:Python40.5k5.7k
esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
3.4k233
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python4.5k385
RVC-Project/Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
Language:Python23.5k3.5k
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
Language:Python29k2.8k
reworkd/AgentGPT
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
Language:TypeScript31.5k9.2k
cumulo-autumn/StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Language:Python9.5k682
CVI-SZU/Linly
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型；ChatFlow中文对话模型；中文OpenLLaMA模型；NLP预训练/指令微调数据集
Language:Python3k233
Tencent/TencentPretrain
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo
Language:Python1k140
magic-research/magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Language:Python10.4k1.1k
HumanAIGC/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
14.4k968
aigc-apps/sd-webui-EasyPhoto
📷 EasyPhoto | Your Smart AI Photo Generator.
Language:Python4.9k387
geekyutao/Inpaint-Anything
Inpaint anything using Segment Anything and inpainting models.
Language:Jupyter Notebook6.4k525
sail-sg/EditAnything
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
Language:Python3.3k188
continue-revolution/sd-webui-segment-anything
Segment Anything for Stable Diffusion WebUI
Language:Python3.4k206
CASIA-IVA-Lab/FastSAM
Fast Segment Anything
Language:Python7.4k693
gaomingqi/Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
Language:Python6.4k478
openimsdk/open-im-server
IM Chat
Language:Go13.9k2.4k
continue-revolution/sd-webui-animatediff
AnimateDiff for AUTOMATIC1111 Stable Diffusion WebUI
Language:Python3.1k253
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook14.9k1.4k
iperov/DeepFaceLive
Real-time face swap for PC streaming or video calls
Language:Python26.3k4.5k
qiuyu96/CoDeF
[CVPR 2024 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Language:Python4.8k386
deepinsight/insightface
State-of-the-art 2D and 3D Face Analysis Project
Language:Python23k5.4k
s0md3v/roop
one-click face swap
Language:Python28.2k6.8k
camenduru/text-to-video-synthesis-colab
Text To Video Synthesis Colab
Language:Jupyter Notebook1.4k174
ali-vilab/composer
Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"
1.5k48
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Language:Python6.4k953