SoonbeomChoi

MACLab

SoonbeomChoi's Stars

sammccord/solid-pixi
Create PIXI applications with JSX and Signals
Language:TypeScript292
polm/cutlet
Japanese to romaji converter in Python
Language:Python29020
taishi-i/awesome-japanese-nlp-resources
A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese
66926
keonlee9420/Comprehensive-E2E-TTS
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
Language:Python14319
heatz123/naturalspeech
A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)
Language:Python46567
lucidrains/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Language:Python1.3k99
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python6.7k1.2k
aparrish/pronouncingjs
a simple javascript interface to the CMU pronouncing dictionary (for node and browser!)
Language:JavaScript705
tauri-apps/tauri
Build smaller, faster, and more secure desktop applications with a web frontend.
Language:Rust82.2k2.5k
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python20.7k2.1k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
11.9k765
xinjli/allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
Language:Python54986
LuChengTHU/dpm-solver
Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)
Language:Python1.5k118
YatingMusic/ddsp-singing-vocoders
Official implementation of SawSing (ISMIR'22)
Language:Python25036
acids-ircam/creative_ml
Creative Machine Learning course and notebook tutorials in JAX, PyTorch and Numpy
Language:Jupyter Notebook21290
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python68k8k
superpoweredSDK/web-audio-javascript-webassembly-SDK-interactive-audio
🌐 Superpowered Web Audio JavaScript and WebAssembly SDK for modern web browsers. Allows developers to implement low-latency interactive audio features into web sites and web apps with a friendly Javascript API. https://superpowered.com
14916
archinetai/audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
Language:Python1.9k167
cainesap/syllabify
Automatically convert plain text into phonemes (US English pronunciation) and syllabify
Language:Python245
repp/big-phoney
Get phonetic spellings and syllable counts for any english word. Works with made-up and non-dictionary words
Language:Python8913
bentoml/BentoML
The easiest way to serve AI apps and models - Build reliable Inference APIs, LLM apps, Multi-model chains, RAG service, and much more!
Language:Python7k779
teticio/audio-diffusion
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.
Language:Jupyter Notebook70370
keonlee9420/DiffGAN-TTS
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
Language:Python31044
csteinmetz1/auraloss
Collection of audio-focused loss functions in PyTorch
Language:Python71966
chomeyama/SiFiGAN
Official implementation of the source-filter HiFiGAN vocoder
Language:Python23334
chq1155/A-Survey-on-Generative-Diffusion-Model
89958
revsic/torch-nansypp
NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis
Language:Python13911
rishikksh20/HiFiplusplus-pytorch
HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement
Language:Python15119
SoonbeomChoi/BEGANSing
Korean Singing Voice Synthesis based on Auto-regressive Boundary Equilibrium GAN
Language:Python6816
lawrencecchen/solid-konva
Language:TypeScript63

SoonbeomChoi

SoonbeomChoi's Stars

sammccord/solid-pixi

polm/cutlet

taishi-i/awesome-japanese-nlp-resources

keonlee9420/Comprehensive-E2E-TTS

heatz123/naturalspeech

lucidrains/naturalspeech2-pytorch

jaywalnut310/vits

aparrish/pronouncingjs

tauri-apps/tauri

facebookresearch/audiocraft

BradyFU/Awesome-Multimodal-Large-Language-Models

xinjli/allosaurus

LuChengTHU/dpm-solver

YatingMusic/ddsp-singing-vocoders

acids-ircam/creative_ml

openai/whisper

superpoweredSDK/web-audio-javascript-webassembly-SDK-interactive-audio

archinetai/audio-diffusion-pytorch

cainesap/syllabify

repp/big-phoney

bentoml/BentoML

teticio/audio-diffusion

keonlee9420/DiffGAN-TTS

csteinmetz1/auraloss

chomeyama/SiFiGAN

chq1155/A-Survey-on-Generative-Diffusion-Model

revsic/torch-nansypp

rishikksh20/HiFiplusplus-pytorch

SoonbeomChoi/BEGANSing

lawrencecchen/solid-konva