p0p4k

medically diagnosed with imposter syndrome

Pinned Repositories

CoquiTTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python1 1 00
EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Language:Python1 1 01
Matcha-TTS-2
E2E TTS using Conditional Flow Matching (Experimental*)
Language:Jupyter Notebook57 10 35
naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Language:Python10
pflowtts_pytorch
Unofficial implementation of NVIDIA P-Flow TTS paper
Language:Python194 15 4027
vits2_pytorch
unofficial vits2-TTS implementation in pytorch
Language:Python460 25 5380
vits3_pytorch
Language:Python26 7 12
voicebox-pytorch
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
Language:Python1 1 00

p0p4k's Repositories

p0p4k/vits2_pytorch
unofficial vits2-TTS implementation in pytorch
Language:Python460 25 5380
p0p4k/pflowtts_pytorch
Unofficial implementation of NVIDIA P-Flow TTS paper
Language:Python194 15 4027
p0p4k/Matcha-TTS-2
E2E TTS using Conditional Flow Matching (Experimental*)
Language:Jupyter Notebook57 10 35
p0p4k/vits3_pytorch
Language:Python26 7 12
p0p4k/CoquiTTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python1 1 00
p0p4k/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Language:Python1 1 01
p0p4k/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Language:Python10
p0p4k/voicebox-pytorch
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
Language:Python1 1 00
p0p4k/g2pK
g2pK: g2p module for Korean
Language:Python0 1 00
p0p4k/g2pkk
Language:Python0 1 00
p0p4k/humble-gumbel
Jupyter notebook on Gumbel-max and Gumbel-softmax tricks
Language:Jupyter Notebook0 1 00
p0p4k/image-nilm
Language:Jupyter Notebook0 1 00
p0p4k/kss
Kss: A Toolkit for Korean sentence segmentation
Language:Python0 1 00
p0p4k/label-studio-converter
Tools for converting Label Studio annotations into common dataset formats
Language:Python0 1 00
p0p4k/MagneticData
MagWi + mobile dataset
2 01
p0p4k/Matcha-TTS
🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
p0p4k/ModifiedOpenLabelling
A modified version of https://github.com/Cartucho/OpenLabeling OpenLabelling tool
Language:Python1 0
p0p4k/p0p4k.github.io
Language:SCSS
p0p4k/pits
PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor
Language:Python1 0
p0p4k/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Python1 0
p0p4k/python-inquirer
A collection of common interactive command line user interfaces, based on Inquirer.js (https://github.com/SBoudrias/Inquirer.js/)
Language:Python1 0
p0p4k/SMART-G2P
p0p4k/speechbrain
A PyTorch-based Speech Toolkit
Language:Python1 0
p0p4k/Tacotron-Korean-Tensorflow2
Language:Python1 0
p0p4k/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
Language:Python1 0
p0p4k/transformer-walkthrough
A walkthrough of transformer architecture code
Language:Jupyter Notebook1 0
p0p4k/Tree-Math
Math behind all the mainstream tree-based machine learning models
1 0
p0p4k/UJIdata
Data for UJI
p0p4k/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python1 0
p0p4k/x-transformers
A simple but complete full-attention transformer with a set of promising experimental features from various papers