CaptainCodeAU's Stars
dotmet/chatgpt_webui
Build a WebUI of ChatGPT with multiple authentication methods using Gradio and revChatGPT
NovelAI/novelai-aspect-ratio-bucketing
Implementation of aspect ratio bucketing for training generative image models as described in: https://blog.novelai.net/novelai-improvements-on-stable-diffusion-e10d38db82ac
Lotayou/Face-Renovation
Official repository of the paper "HiFaceGAN: Face Renovation via Collaborative Suppression and Replenishment".
ageitgey/face_recognition
The world's simplest facial recognition api for Python and the command line
victorchall/EveryDream2trainer
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
ShivamShrirao/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
danielgatis/rembg
Rembg is a tool to remove images background
facebookresearch/AnimatedDrawings
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
openai/jukebox
Code for the paper "Jukebox: A Generative Model for Music"
PlayVoice/lora-svc
singing voice change based on whisper, and lora for singing voice clone
PlayVoice/whisper-vits-svc
Core Engine of Singing Voice Conversion & Singing Voice Clone
OlaWod/FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
sergree/matchering-cli
🎚️ Simple Matchering 2.0 Command Line Application
sergree/matchering
🎚️ Open Source Audio Matching and Mastering
zhangyongmao/VISinger2
VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer
CjangCjengh/TTSModels
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
bloodraven66/DeepForcedAligner
bloodraven66/sslAAI
codes for ICASSP submission
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
bloodraven66/asr_utils
bloodraven66/ICASSP_LIMMITS23
facebookresearch/hydra
Hydra is a framework for elegantly configuring complex applications
vanhauser-thc/thc-hydra
hydra
openvpi/DiffSinger
An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
shivam-shukla/Speech-Dataset-in-Hindi-Language
Anjok07/ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
voicepaw/so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.