weishanyi's Stars
conda-forge/miniforge
A conda-forge distribution.
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
haoxiangsnr/spiking-fullsubnet
Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.
gabrielmittag/NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
modelscope/FunCodec
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
y-ren16/TiCodec
exercise-book-yq/Supercodec
haoheliu/SemantiCodec-inference
Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.
Xiaobin-Rong/gtcrn
The official implementation of GTCRN, an ultra-lite speech enhancement model.
microsoft/SIG-Challenge
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
OpenTalker/SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
ThomasHaubner/e2e_dnn_ad_control_for_lin_aec
End-To-End Deep Learning-based Adaptation Control for Linear Acoustic Echo Cancellation
rany2/edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
microsoft/nni
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
Rikorose/DeepFilterNet
Noise supression using deep filtering
yuguochencuc/BAE-Net
BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION
pytorch/executorch
On-device AI across mobile, embedded and edge for PyTorch
xiph/LPCNet
Efficient neural speech synthesis
YUCHEN005/NASE
Code for paper "Noise-aware Speech Enhancement using Diffusion Probabilistic Model"
Crystalsound/FRN
lhwcv/self_attention_alignment
Deep model with built-in self-attention alignment for acoustic echo cancellation, Pytorch implement
fjiang9/NKF-AEC
Acoustic Echo Cancellation with Nerual Kalman Filtering
AndreevP/wvmos
MOS score prediction by fine-tuned wav2vec2.0 model
crlandsc/Music-Demixing-with-Band-Split-RNN
An unofficial PyTorch implementation of Music Source Separation with Band-split RNN for MDX-23 ("Label Noise" Track)
Okrio/CRUSE
a lightweight network for monaural speech enhancement
THUDM/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
FlagAI-Open/FlagAI
FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.
alibabasglab/FRCRN