natlamir

Pinned Repositories

DINet
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
Language:Python37 1 08
DINet-UI
Windows Forms user interface for making lip sync videos with DINet and OpenFace
Language:C#24 3 14
LLaVA-Windows
[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards multimodal GPT-4 level capabilities.
Language:Python23 0 04
magic-animate
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Language:Python8 0 02
OnlySpeakTTS
Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes speech generation much faster by default.
Language:Python10 0 04
PiperUI
A UI for the Piper TTS
Language:C#67 6 148
ProjectFiles
Where I will be storing misc files with details / links used during the installation process, etc
Language:Jupyter Notebook11 2 03
tortoise-WebUI
A multi-voice TTS system trained with an emphasis on quality
Language:Jupyter Notebook24 2 05
video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Language:Python35 1 010
Wav2Lip-WebUI
A wav2lip Web UI using Gradio
Language:Python65 5 09

natlamir's Repositories

natlamir/PiperUI
A UI for the Piper TTS
Language:C#67 6 148
natlamir/Wav2Lip-WebUI
A wav2lip Web UI using Gradio
Language:Python65 5 09
natlamir/DINet
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
Language:Python37 1 08
natlamir/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Language:Python35 1 010
natlamir/DINet-UI
Windows Forms user interface for making lip sync videos with DINet and OpenFace
Language:C#24 3 14
natlamir/tortoise-WebUI
A multi-voice TTS system trained with an emphasis on quality
Language:Jupyter Notebook24 2 05
natlamir/LLaVA-Windows
[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards multimodal GPT-4 level capabilities.
Language:Python23 0 04
natlamir/ProjectFiles
Where I will be storing misc files with details / links used during the installation process, etc
Language:Jupyter Notebook11 2 03
natlamir/OnlySpeakTTS
Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes speech generation much faster by default.
Language:Python10 0 04
natlamir/LipStick
A virtual makeup that will make faces look radiant! Get rid of that ugly face mask box on your videos: Get your magic Lipstick now!
Language:Python93
natlamir/magic-animate
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Language:Python8 0 02
natlamir/AudioSep
implementation of "Separate Anything You Describe"
Language:Python7 0 02
natlamir/MeloTTS-Windows
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Language:Python6 0 01
natlamir/sd-wav2lip-uhq
Wav2Lip UHQ extension for Automatic1111
Language:Python6 0 01
natlamir/tpsm
[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.
Language:Jupyter Notebook5 1 0
natlamir/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language:Python4 0 01
natlamir/zero123plus
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
Language:Python4 0 0
natlamir/PixArt-alpha
Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Language:Python3 0 0
natlamir/SadTalker
[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Language:Python3 0 0
natlamir/SdPaint
Stable Diffusion Painting
Language:Python3 0 0
natlamir/vid2densepose
Convert your videos to densepose and use it on MagicAnimate
Language:Python3 0 0
natlamir/a11
Stable Diffusion web UI
Language:Python2 0 0
natlamir/audio-webui
A webui for different audio related Neural Networks
Language:Python1 0 0
natlamir/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook1 0 0
natlamir/fabric-server
Language:HTML1 1 0
natlamir/OogaBooga
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
Language:Python1 0 01
natlamir/piper
A fast, local neural text to speech system
Language:C++1 0 0
natlamir/Show-1
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
Language:Python1 0 0
natlamir/StabilityMatrix
Multi-Platform Package Manager for Stable Diffusion
Language:C#1 0 0
natlamir/biniou
a self-hosted webui for 30+ generative ai