Pinned Repositories
CoquiTTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Matcha-TTS-2
E2E TTS using Conditional Flow Matching (Experimental*)
naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
pflowtts_pytorch
Unofficial implementation of NVIDIA P-Flow TTS paper
vits2_pytorch
unofficial vits2-TTS implementation in pytorch
vits3_pytorch
voicebox-pytorch
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
p0p4k's Repositories
p0p4k/vits2_pytorch
unofficial vits2-TTS implementation in pytorch
p0p4k/pflowtts_pytorch
Unofficial implementation of NVIDIA P-Flow TTS paper
p0p4k/Matcha-TTS-2
E2E TTS using Conditional Flow Matching (Experimental*)
p0p4k/vits3_pytorch
p0p4k/CoquiTTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
p0p4k/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
p0p4k/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
p0p4k/voicebox-pytorch
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
p0p4k/g2pK
g2pK: g2p module for Korean
p0p4k/g2pkk
p0p4k/humble-gumbel
Jupyter notebook on Gumbel-max and Gumbel-softmax tricks
p0p4k/image-nilm
p0p4k/kss
Kss: A Toolkit for Korean sentence segmentation
p0p4k/label-studio-converter
Tools for converting Label Studio annotations into common dataset formats
p0p4k/MagneticData
MagWi + mobile dataset
p0p4k/Matcha-TTS
🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
p0p4k/ModifiedOpenLabelling
A modified version of https://github.com/Cartucho/OpenLabeling OpenLabelling tool
p0p4k/p0p4k.github.io
p0p4k/pits
PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor
p0p4k/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
p0p4k/python-inquirer
A collection of common interactive command line user interfaces, based on Inquirer.js (https://github.com/SBoudrias/Inquirer.js/)
p0p4k/SMART-G2P
p0p4k/speechbrain
A PyTorch-based Speech Toolkit
p0p4k/Tacotron-Korean-Tensorflow2
p0p4k/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
p0p4k/transformer-walkthrough
A walkthrough of transformer architecture code
p0p4k/Tree-Math
Math behind all the mainstream tree-based machine learning models
p0p4k/UJIdata
Data for UJI
p0p4k/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
p0p4k/x-transformers
A simple but complete full-attention transformer with a set of promising experimental features from various papers