Pinned Repositories
stable-diffusion-webui
Stable Diffusion web UI
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
rembg
Rembg is a tool to remove images background
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
sd-webui-controlnet
WebUI extension for ControlNet
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
CodeFormer
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
hjj-lmx's Repositories
hjj-lmx doesn’t have any repository yet.