hwRG

Speech AI Engineer

@AITRICSSEOUL, REPUBLIC OF KOREA

hwRG's Stars

openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python70.9k 576 08.4k
meta-llama/llama
Inference code for Llama models
Language:Python56.3k 527 9859.6k
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Python39.1k 445 3135k
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook36k 331 4414.2k
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
Language:C++35.5k 312 1.4k3.6k
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter Notebook25.8k 323 4023.3k
mlfoundations/open_clip
An open source implementation of CLIP.
Language:Python10.3k 78 485979
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Language:Python8.3k 144 3.8k1.5k
lixin4ever/Conference-Acceptance-Rate
Acceptance rates for the major AI conferences
Language:Jupyter Notebook4.2k 128 28298
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
Language:Python3k 87 98419
davabase/whisper_real_time
Real time transcription with OpenAI Whisper.
Language:Python2.4k 29 49398
wiseman/py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
Language:C2.1k 50 82409
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Language:Python2k 49 126319
tsurumeso/vocal-remover
Vocal Remover using Deep Neural Networks
Language:Python1.6k 39 134227
Edresson/YourTTS
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Language:Jupyter Notebook898 24 5079
NVIDIA/BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
Language:Python879 70 0101
maum-ai/nuwave
NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling @ INTERSPEECH 2021
Language:Python282 11 1520
andersonba/yve-bot
Smart rule-based bot. For Browser & Node.
Language:TypeScript224 7 9957
junhsss/consistency-models
A Toolkit for OpenAI's Consistency Models.
Language:Python197 9 1112
hayeong0/DDDM-VC
Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion" (AAAI 2024)
Language:Python194 16 1920
keonlee9420/StyleSpeech
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
Language:Python190 6 1723
keonlee9420/Cross-Speaker-Emotion-Transfer
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
Language:Python187 7 1727
dhchoi99/NANSY
Language:Python161 14 1920
keonlee9420/STYLER
Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, INTERSPEECH 2021
Language:Python158 6 631
ncsoft/avocodo
Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)
Language:Python149 4 519
haoheliu/ssr_eval
Evaluation and Benchmarking of Speech Super-resolution Methods
Language:Python140 4 1112
neonbjb/tts-scores
Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models
Language:Python140 5 1415
SMART-TTS/SMART-G2P
Language:Python92 3 339
richardbaihe/a3t
Code for paper A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
Language:Python85 4 810
scarletcho/KoLM
Korean text normalization and language preparation package for LM in Kaldi-based ASR system
Language:Python59 5 218

hwRG

hwRG's Stars

openai/whisper

meta-llama/llama

Stability-AI/stablediffusion

suno-ai/bark

ggerganov/whisper.cpp

openai/CLIP

mlfoundations/open_clip

triton-inference-server/server

lixin4ever/Conference-Acceptance-Rate

enhuiz/vall-e

davabase/whisper_real_time

wiseman/py-webrtcvad

lifeiteng/vall-e

tsurumeso/vocal-remover

Edresson/YourTTS

NVIDIA/BigVGAN

maum-ai/nuwave

andersonba/yve-bot

junhsss/consistency-models

hayeong0/DDDM-VC

keonlee9420/StyleSpeech

keonlee9420/Cross-Speaker-Emotion-Transfer

dhchoi99/NANSY

keonlee9420/STYLER

ncsoft/avocodo

haoheliu/ssr_eval

neonbjb/tts-scores

SMART-TTS/SMART-G2P

richardbaihe/a3t

scarletcho/KoLM