Pinned Repositories
3D-Face-GCNs
Towards High-Fidelity 3D Face Reconstruction from In-the-Wild Images Using Graph Convolutional Networks, CVPR 2020
3DHM
Synthesizing Moving People with 3D Control
AI-Vtuber
AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain(本地/llm)/chatglm/text-generation-webui/闻达/文心一言】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/斗鱼/YouTube/twitch】 直播中与观众实时互动 或 直接在本地进行聊天。它使用文本转语音技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声;通过特定指令协同SD进行画图。
AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
AnimatedDrawings
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"
AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
apitable
🚀🎉📚 APITable, an API-oriented low-code platform for building collaborative apps and better than all other Airtable open-source alternatives.
GPT2-chitchat
GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)
MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
R_sentiment_analysis
基于哈工大词林词相似度分析R实现
jackstephen's Repositories
jackstephen/3DHM
Synthesizing Moving People with 3D Control
jackstephen/AI-Vtuber
AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain(本地/llm)/chatglm/text-generation-webui/闻达/文心一言】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/斗鱼/YouTube/twitch】 直播中与观众实时互动 或 直接在本地进行聊天。它使用文本转语音技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声;通过特定指令协同SD进行画图。
jackstephen/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
jackstephen/AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
jackstephen/awesome-virtual-try-on
A curated list of awesome research papers, projects, code, dataset, workshops etc. related to virtual try-on.
jackstephen/ComfyUI
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
jackstephen/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
jackstephen/facefusion
Next generation face swapper and enhancer
jackstephen/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
jackstephen/HeyGenClone
A simple and open-source analogue of the HeyGen system
jackstephen/HR-VITON
Official PyTorch implementation for the paper High-Resolution Virtual Try-On with Misalignment and Occlusion-Handled Conditions (ECCV 2022).
jackstephen/langchain-ChatGLM
langchain-ChatGLM, local knowledge based ChatGLM with langchain | 基于本地知识库的 ChatGLM 问答
jackstephen/magic-animate
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
jackstephen/metahuman-stream
Real time streaming digital human based on nerf
jackstephen/NativeSpeaker
make your Speaker talking as Native style with own voice!
jackstephen/OutfitAnyone
Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person
jackstephen/PoseGPT
jackstephen/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
jackstephen/SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
jackstephen/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
jackstephen/stable-diffusion-webui
Stable Diffusion web UI
jackstephen/StableVITON
jackstephen/street-tryon-benchmark
StreetTryOn: A Benchmark for In-the-Wild Virtual Try-On and Cross-Domain Virtual Try-On
jackstephen/SUPIR
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild
jackstephen/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
jackstephen/TTS-for-GPT-soVITS
这是一个简单的TTS后端项目 基于https://github.com/RVC-Boss/GPT-SoVITS 并提供了一些推理优化的特性/This is a simple TTS backend project based on https://github.com/RVC-Boss/GPT-SoVITS and provides some inference optimization features:
jackstephen/vid2densepose
Convert your videos to densepose and use it on MagicAnimate
jackstephen/VideoSwap
Code for VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
jackstephen/VITON-HD
Official PyTorch implementation of "VITON-HD: High-Resolution Virtual Try-On via Misalignment-Aware Normalization" (CVPR 2021)
jackstephen/VividTalk
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior