Pinned Repositories
DINet
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
DINet-UI
Windows Forms user interface for making lip sync videos with DINet and OpenFace
LLaVA-Windows
[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards multimodal GPT-4 level capabilities.
magic-animate
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
OnlySpeakTTS
Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes speech generation much faster by default.
PiperUI
A UI for the Piper TTS
ProjectFiles
Where I will be storing misc files with details / links used during the installation process, etc
tortoise-WebUI
A multi-voice TTS system trained with an emphasis on quality
video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Wav2Lip-WebUI
A wav2lip Web UI using Gradio
natlamir's Repositories
natlamir/PiperUI
A UI for the Piper TTS
natlamir/Wav2Lip-WebUI
A wav2lip Web UI using Gradio
natlamir/DINet
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
natlamir/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
natlamir/DINet-UI
Windows Forms user interface for making lip sync videos with DINet and OpenFace
natlamir/tortoise-WebUI
A multi-voice TTS system trained with an emphasis on quality
natlamir/LLaVA-Windows
[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards multimodal GPT-4 level capabilities.
natlamir/ProjectFiles
Where I will be storing misc files with details / links used during the installation process, etc
natlamir/OnlySpeakTTS
Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes speech generation much faster by default.
natlamir/LipStick
A virtual makeup that will make faces look radiant! Get rid of that ugly face mask box on your videos: Get your magic Lipstick now!
natlamir/magic-animate
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
natlamir/AudioSep
implementation of "Separate Anything You Describe"
natlamir/MeloTTS-Windows
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
natlamir/sd-wav2lip-uhq
Wav2Lip UHQ extension for Automatic1111
natlamir/tpsm
[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.
natlamir/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
natlamir/zero123plus
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
natlamir/PixArt-alpha
Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
natlamir/SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
natlamir/SdPaint
Stable Diffusion Painting
natlamir/vid2densepose
Convert your videos to densepose and use it on MagicAnimate
natlamir/a11
Stable Diffusion web UI
natlamir/audio-webui
A webui for different audio related Neural Networks
natlamir/bark
🔊 Text-Prompted Generative Audio Model
natlamir/fabric-server
natlamir/OogaBooga
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
natlamir/piper
A fast, local neural text to speech system
natlamir/Show-1
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
natlamir/StabilityMatrix
Multi-Platform Package Manager for Stable Diffusion
natlamir/biniou
a self-hosted webui for 30+ generative ai