drzodozo's Stars
zsyOAOA/ResShift
ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (NeurIPS@2023 Spotlight, TPAMI@2024)
fudan-generative-vision/champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
sherlock-project/sherlock
Hunt down social media accounts by username across social networks
facefusion/facefusion
Industry leading face manipulation platform
Stability-AI/generative-models
Generative Models by Stability AI
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
modelscope/facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
glucauze/sd-webui-faceswaplab
Extended faceswap extension for StableDiffusion web-ui with multiple faceswaps, inpainting, checkpoints, ....
kohya-ss/sd-scripts
derrian-distro/LoRA_Easy_Training_Scripts
A UI made in Pyside6 to make training LoRA/LoCon and other LoRA type models in sd-scripts easy
LituRout/PSLD
Posterior Sampling using Latent Diffusion
guoyww/AnimateDiff
Official implementation of AnimateDiff.
assafelovic/gpt-researcher
LLM based autonomous agent that conducts in-depth web research on any given topic
skypilot-org/skypilot
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
SizheAn/PanoHead
Code Repository for CVPR 2023 Paper "PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360 degree"
ParisNeo/lollms-webui
Lord of Large Language Models Web User Interface
hyprwm/Hyprland
Hyprland is an independent, highly customizable, dynamic tiling Wayland compositor that doesn't sacrifice on its looks.
152334H/tortoise-tts-fast
Fast TorToiSe inference (5x or your money back!)
PromtEngineer/localGPT
Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.
s0md3v/roop
one-click face swap
RVC-Project/Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
serp-ai/bark-with-voice-clone
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
volotat/SD-CN-Animation
This script allows to automate video stylization task using StableDiffusion and ControlNet.
vijishmadhavan/UnpromptedControl
Remove unwanted objects and restore images without prompts, powered by ControlNet.
yuliskov/SmartTube
SmartTube - an advanced player for set-top boxes and tvs running Android OS
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.