weishanyi

weishanyi's Stars

conda-forge/miniforge
A conda-forge distribution.
Language:Shell6.6k336
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Language:Python7.7k957
haoxiangsnr/spiking-fullsubnet
Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.
Language:Python7814
gabrielmittag/NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
Language:Python699118
modelscope/FunCodec
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
Language:Python37131
y-ren16/TiCodec
Language:Python573
exercise-book-yq/Supercodec
Language:Python426
haoheliu/SemantiCodec-inference
Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.
Language:Python1569
Xiaobin-Rong/gtcrn
The official implementation of GTCRN, an ultra-lite speech enhancement model.
Language:Python22038
microsoft/SIG-Challenge
Language:Python766
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
Language:Python30k3k
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Language:Python2.5k266
OpenTalker/SadTalker
[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Language:Python12k2.2k
ThomasHaubner/e2e_dnn_ad_control_for_lin_aec
End-To-End Deep Learning-based Adaptation Control for Linear Acoustic Echo Cancellation
Language:Python2712
rany2/edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Language:Python6.5k635
microsoft/nni
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
Language:Python14.1k1.8k
Rikorose/DeepFilterNet
Noise supression using deep filtering
Language:Python2.6k239
yuguochencuc/BAE-Net
BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION
Language:Python583
pytorch/executorch
On-device AI across mobile, embedded and edge for PyTorch
Language:C++2.2k376
xiph/LPCNet
Efficient neural speech synthesis
Language:C1.1k295
YUCHEN005/NASE
Code for paper "Noise-aware Speech Enhancement using Diffusion Probabilistic Model"
Language:Python842
Crystalsound/FRN
Language:Python266
lhwcv/self_attention_alignment
Deep model with built-in self-attention alignment for acoustic echo cancellation, Pytorch implement
Language:Python3612
fjiang9/NKF-AEC
Acoustic Echo Cancellation with Nerual Kalman Filtering
Language:HTML24261
AndreevP/wvmos
MOS score prediction by fine-tuned wav2vec2.0 model
Language:Python14919
crlandsc/Music-Demixing-with-Band-Split-RNN
An unofficial PyTorch implementation of Music Source Separation with Band-split RNN for MDX-23 ("Label Noise" Track)
Language:Python14413
Okrio/CRUSE
a lightweight network for monaural speech enhancement
Language:Python5010
THUDM/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Language:Python15.7k1.9k
FlagAI-Open/FlagAI
FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.
Language:Python3.8k416
alibabasglab/FRCRN
13512