Pinned Repositories
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
apollo-kotlin-tutorial
The code for the Apollo Kotlin Tutorial
arxiv-sanity-lite
arxiv-sanity lite: tag arxiv papers of interest get recommendations of similar papers in a nice UI using SVMs over tfidf feature vectors based on paper abstracts.
aTENNuate
babyagi
BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
cog
Containers for machine learning
cog-HierSpeechpp
Cog wrapper for HierSpeech++
ComfyUI-AdvancedLivePortrait
kunibald413's Repositories
kunibald413/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
kunibald413/aTENNuate
kunibald413/BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
kunibald413/ComfyUI-AdvancedLivePortrait
kunibald413/ComfyUI-Docker
🐳Dockerfile for 🎨ComfyUI. | 容器镜像与启动脚本
kunibald413/coqui-ai-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
kunibald413/cupy
NumPy & SciPy for GPU
kunibald413/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
kunibald413/e2-tts-pytorch
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
kunibald413/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
kunibald413/FasterLivePortrait
Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!
kunibald413/fasthtml
The fastest way to create an HTML app
kunibald413/fish-speech
Brand new TTS solution
kunibald413/flux
Official inference repo for FLUX.1 models
kunibald413/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
kunibald413/LivePortrait
Bring portraits to life!
kunibald413/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
kunibald413/neural_net_checklist
kunibald413/nocode
The best way to write secure and reliable applications. Write nothing; deploy nowhere.
kunibald413/parler-tts
Inference and training library for high-quality TTS models.
kunibald413/RAVSS
kunibald413/SenseVoice
Multilingual Voice Understanding Model
kunibald413/SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
kunibald413/tinygrad-vit
A minimalist implementation of the ViT (Vision Transformer) model, using tinygrad
kunibald413/Train_Hifigan_XTTS
This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.
kunibald413/TTS-arxiv-daily
Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)
kunibald413/USpeech
official implementation of USpeech: Ultrasound-Enhanced Speech with Minimal Human Effort via Cross-Modal Synthesis
kunibald413/vall-e-ecker
An unofficial PyTorch implementation of VALL-E
kunibald413/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
kunibald413/YoutubePlaylistDownloader
A tool to download whole playlists, channels or single videos from youtube and also optionally convert them to almost any format you would like