Hongjiang-Yu

Speech processing

Concordia University

Hongjiang-Yu's Stars

fishaudio/fish-speech
Brand new TTS solution
Language:Python5.9k460
WenetSpeech4TTS/wenetspeech4tts
Language:HTML4
MatsuriDayo/NekoBoxForAndroid
NekoBox for Android / sing-box / universal proxy toolchain for Android
Language:Kotlin9.9k853
githubvpn007/ClashX
ClashX，ClashX教程，ClashX配置教程，ClashX for mac
31438
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Language:Jupyter Notebook7.3k713
facebookresearch/ConvNeXt-V2
Code release for ConvNeXt V2 model
Language:Python1.4k112
NVIDIA/BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
Language:Python77389
maum-ai/univnet
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
Language:Python25846
2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python28k3k
TeaPoly/Conformer-Athena
Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.
Language:Python438
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Language:Python3.9k1k
sony/bigvsan
Pytorch implementation of BigVSAN
Language:Python19017
keithito/tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Language:Python2.9k965
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8.1k569
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python20.2k2k
csukuangfj/kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
Language:C++17634
k2-fsa/icefall
Language:Python843273
descriptinc/descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
Language:Python1k90
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Language:Python1.9k318
X-LANCE/UniCATS-CTX-vec2wav
[AAAI 2024] Code for CTX-vec2wav in UniCATS
Language:Python11015
Azure-Samples/cognitive-services-speech-sdk
Sample code for the Microsoft Cognitive Services Speech SDK
Language:C#2.7k1.8k
lhotse-speech/lhotse
Tools for handling speech data in machine learning projects.
Language:Python908205
pseeth/argbind
Simple package for binding functions to CLI or config files.
Language:Python441
sh-lee-prml/HierSpeechpp
The official implementation of HierSpeech++
Language:Python1.1k135
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python130k25.7k
haoheliu/AudioLDM2
Text-to-Audio/Music Generation
Language:Python2.2k172
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Language:Jupyter Notebook10.6k1k
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Language:Python4.5k354
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python32.1k3.9k
facebookresearch/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Language:Python3.3k299

Hongjiang-Yu

Hongjiang-Yu's Stars

fishaudio/fish-speech

WenetSpeech4TTS/wenetspeech4tts

MatsuriDayo/NekoBoxForAndroid

githubvpn007/ClashX

jasonppy/VoiceCraft

facebookresearch/ConvNeXt-V2

NVIDIA/BigVGAN

maum-ai/univnet

2noise/ChatTTS

TeaPoly/Conformer-Athena

wenet-e2e/wenet

sony/bigvsan

keithito/tacotron

facebookresearch/xformers

facebookresearch/audiocraft

csukuangfj/kaldifeat

k2-fsa/icefall

descriptinc/descript-audio-codec

lifeiteng/vall-e

X-LANCE/UniCATS-CTX-vec2wav

Azure-Samples/cognitive-services-speech-sdk

lhotse-speech/lhotse

pseeth/argbind

sh-lee-prml/HierSpeechpp

huggingface/transformers

haoheliu/AudioLDM2

facebookresearch/seamless_communication

yl4579/StyleTTS2

coqui-ai/TTS

facebookresearch/encodec