why-arong's Stars
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
coqui-ai/TTS
๐ธ๐ฌ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
google/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
KindXiaoming/pykan
Kolmogorov Arnold Networks
jordanbaird/Ice
Powerful menu bar manager for macOS
shenweichen/DeepCTR
Easy-to-use,Modular and Extendible package of deep-learning based CTR models .
brave-people/Dev-Event
๐๐ ๊ฐ๋ฐ์ {์จ๋น๋, ์ปจํผ๋ฐ์ค, ํด์ปคํค} ํ์ฌ๋ฅผ ์๋ ค๋๋ฆฝ๋๋ค. [with ๋จ์ก๋ฆฌ ์ผ๋ฒ์ง]
MagicStack/asyncpg
A fast PostgreSQL Database Client Library for Python/asyncio.
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Codium-ai/pr-agent
๐CodiumAI PR-Agent: An AI-Powered ๐ค Tool for Automated Pull Request Analysis, Feedback, Suggestions and More! ๐ป๐
Plachtaa/VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
sanchit-gandhi/whisper-jax
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
pagefaultgames/pokerogue
A browser based Pokรฉmon fangame heavily inspired by the roguelite genre.
zpoint/CPython-Internals
Dive into CPython internals, trying to illustrate every detail of CPython implementation
dl0312/open-apis-korea
๐ฐ๐ท ํ๊ตญ์ด ์ฌ์ฉ์๋ฅผ ์ํ ์๋น์ค์ ์ฌ์ฉํ๊ธฐ ์ํ ์คํ API ๋ชจ์
linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
wannesm/dtaidistance
Time series distances: Dynamic Time Warping (fast DTW implementation in C)
hibuz/dev-conf-replay
๐ ์ต๊ทผ ๊ตญ๋ด IT ์ธ๋ฏธ๋ ๋ฐ ๊ฐ๋ฐ์๐ป ์ปจํผ๋ฐ์ค ์์์ ๋ค์ ๋ณด๊ธฐ๐ ๋งํฌ๋ฅผ ํ๊ณณ์ ์ ๋ฆฌํ์ต๋๋ค!
shivammehta25/Matcha-TTS
[ICASSP 2024] ๐ต Matcha-TTS: A fast TTS architecture with conditional flow matching
lablup/backend.ai
Backend.AI is a streamlined, container-based computing cluster platform that hosts popular computing/ML frameworks and diverse programming languages, with pluggable heterogeneous accelerator support including CUDA GPU, ROCm GPU, TPU, IPU and other NPUs.
rtzr/Awesome-Korean-Speech-Recognition
ํ๊ตญ์ด ์์ฑ์ธ์ STT API ๋ฆฌ์คํธ. ๊ฐ ์ฑ๋ฅ ๋ฒค์น๋งํฌ.
Kyubyong/g2pK
g2pK: g2p module for Korean
executablebooks/mystmd
Command line tools for working with MyST Markdown.
hccho2/Tacotron-Wavenet-Vocoder-Korean
Tacotron, Korean, Wavenet-Vocoder, Korean TTS
executablebooks/meta
A community dedicated to supporting tools for technical and scientific communication and interactive computing
Pseudo-Lab/CPython-Guide
CPython ํํค์น๊ธฐ ์คํฐ๋
kookmin-sw/capstone-2024-08
์๋์ด์ ์ค๋น์์ ์ํ ๋ง์ถคํ AI ์คํผ์น ์ฐ์ต ์ ํ๋ฆฌ์ผ์ด์ , Loro(๋ก๋ก)
timesoft-nia/NIA22_2-017
NIA22 2-017 ๋ด์ค ๋๋ณธ ๋ฐ ์ต์ปค ์์ฑ ๋ฐ์ดํฐ
Balajirvp/Dynamic-Time-Warping
Leveraged Dynamic Time Warping (DTW) to assess the similarity between specific audio tracks