Pinned Repositories
asteroid
The PyTorch-based audio source separation toolkit for researchers || Pretrained models available
athena-signal
audio-visual-speech-enhancement
Official Implementation of "Visual Speech Enhancement", Interspeech 2018.
av-se
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
avobjects
Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"
awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
awesome-speech-recognition-speech-synthesis-papers
Speech synthesis, voice conversion, self-supervised learning, music generation,Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling
bsseval
audio source separation evaluation metrics
CodingInterviewChinese2
《剑指Offer》第二版源代码
ConferencingSpeech2022
Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications
wangyang199609's Repositories
wangyang199609/av-se
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
wangyang199609/asteroid
The PyTorch-based audio source separation toolkit for researchers || Pretrained models available
wangyang199609/athena-signal
wangyang199609/avobjects
Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"
wangyang199609/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
wangyang199609/ConferencingSpeech2022
Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications
wangyang199609/dnn_aec_data_process
pre-process script for timit data for dnn-aec works
wangyang199609/Dual-Path-Transformer-Network-PyTorch
Unofficial implementation of Dual-Path Transformer Network (DPTNet) for speech separation (Interspeech 2020)
wangyang199609/facenet-pytorch
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
wangyang199609/fucking-algorithm
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
wangyang199609/learngit
wangyang199609/Lipreading_using_Temporal_Convolutional_Networks
ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks
wangyang199609/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
wangyang199609/MuSE
wangyang199609/ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
wangyang199609/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
wangyang199609/RIR-Generator
Generating room impulse responses
wangyang199609/rir-generator-1
wangyang199609/rnnoise
Recurrent neural network for audio noise reduction
wangyang199609/speaker_extraction_SpEx
multi-scale time domain speaker extraction
wangyang199609/speech-demo.github.io
wangyang199609/SpeechAlgorithms
Speech Algorithms , from 语音算法组
wangyang199609/speechbrain
A PyTorch-based Speech Toolkit
wangyang199609/traditional-speech-enhancement
语音增强传统方法
wangyang199609/Tutorial_Separation
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
wangyang199609/v2rayNvpn
翻墙、免费翻墙、免费科学上网、免费节点、免费梯子、免费ss/ssr/v2ray/trojan节点、蓝灯、谷歌商店、翻墙梯子 、外网游戏、国外游戏、vpn、vpn推荐、每天更新、上外网、外网、V2rayN、Qv2ray、V2rayW、V2RayS、Mellow、V2rayX、V2rayU、ClashX、Kitsunebi、BifrostV、i2Ray 、Quantumult、Surge 4、winXray、Qv2ray、Kitsunebi、Trojan-Qt5、代理服务器、机场、马里奥、魔兽世界、poshMark、亚马逊、虾皮、煤炉、Mercari、外贸
wangyang199609/VoViT
VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer
wangyang199609/WebRTC_NS
Noise Suppression Module Port From WebRTC
wangyang199609/youtube-dl
Command-line program to download videos from YouTube.com and other video sites
wangyang199609/yt-dlp
A youtube-dl fork with additional features and fixes