Pinned Repositories
3d-photo-inpainting-Windows
[CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting
animegan2-pytorch-Windows
PyTorch implementation of AnimeGANv2
ECON
ECON: Explicit Clothed humans Obtained from Normals (arXiv 2022)
FakeYou-Tacotron2-Notebook
Tacotron2 Training Notebook for FakeYou.com
so-vits-svc-4.0
SoftVC VITS Singing Voice Conversion
so-vits-svc-4.0-v2
SoftVC VITS Singing Voice Conversion
StyleFlow-Windows-10
StyleFlow: Attribute-conditioned Exploration of StyleGAN-generated Images using Conditional Continuous Normalizing Flows
TalkNET-colab
NVIDIA's TalkNET - Train and Synthesize on colab
VQGAN-CLIP
VQGAN+CLIP Colab Notebook with user-friendly interface.
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
justinjohn0306's Repositories
justinjohn0306/FakeYou-Tacotron2-Notebook
Tacotron2 Training Notebook for FakeYou.com
justinjohn0306/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
justinjohn0306/MaskGCT-Windows
MaskGCT-Windows For Windows Users
justinjohn0306/TalkNET-colab
NVIDIA's TalkNET - Train and Synthesize on colab
justinjohn0306/SpeedScribe
High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback.
justinjohn0306/GPT-SoVITS-No-WebUI
No WebUI google colab implementation of GPT-SoVITS
justinjohn0306/versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
justinjohn0306/epitran
A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)
justinjohn0306/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
justinjohn0306/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
justinjohn0306/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
justinjohn0306/GPT-Talker
Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)
justinjohn0306/GRM
Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation
justinjohn0306/memo-windows
Memory-Guided Diffusion for Expressive Talking Video Generation
justinjohn0306/BHTwitter
Awesome tweak for Twitter
justinjohn0306/Efficient-Face2Vid-Portrait
justinjohn0306/FacePoke
Select a portrait, click to move the head around (please use your own space / GPU!)
justinjohn0306/FCLAFI
Fast Character Lossless Anime Video Frame Interpolation
justinjohn0306/GPT-SoVITS2
GPT-SoVITS2
justinjohn0306/LangSegment
It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool. 强大的TTS多语言(97种语言)混合文本内容自动分词工具。
justinjohn0306/LivePortrait
Make one portrait alive!
justinjohn0306/phonemizer
Simple text to phones converter for multiple languages
justinjohn0306/ryujinx
Downstream fork of the Ryujinx project
justinjohn0306/SCInsta
A feature-rich tweak for Instagram on iOS!
justinjohn0306/Style-Bert-VITS2
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.
justinjohn0306/Unique3D
Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
justinjohn0306/uYouEnhanced
uYouEnhanced is an expanded version of uYou+ (made by @qnblackcat) with additional features and mainly made for non jailbroken users!
justinjohn0306/Whisper-AFE-TalkingHeadsGen
justinjohn0306/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
justinjohn0306/YTMusicUltimate
The best tweak for YouTube Music iOS.