wangyang199609

get busy living or get busy dying

Pinned Repositories

asteroid
The PyTorch-based audio source separation toolkit for researchers || Pretrained models available
Language:Python0 0 00
athena-signal
Language:C0 0 00
audio-visual-speech-enhancement
Official Implementation of "Visual Speech Enhancement", Interspeech 2018.
Language:Python00
av-se
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
1 0 00
avobjects
Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"
Language:Python0 0 00
awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
0 0 00
awesome-speech-recognition-speech-synthesis-papers
Speech synthesis, voice conversion, self-supervised learning, music generation,Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling
00
bsseval
audio source separation evaluation metrics
Language:Python0 0 00
CodingInterviewChinese2
《剑指Offer》第二版源代码
Language:C++00
ConferencingSpeech2022
Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications
Language:Python00

wangyang199609's Repositories

wangyang199609/av-se
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
1 0 00
wangyang199609/asteroid
The PyTorch-based audio source separation toolkit for researchers || Pretrained models available
Language:Python0 0 00
wangyang199609/athena-signal
Language:C0 0 00
wangyang199609/avobjects
Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"
Language:Python0 0 00
wangyang199609/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
0 0 00
wangyang199609/ConferencingSpeech2022
Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications
Language:Python00
wangyang199609/dnn_aec_data_process
pre-process script for timit data for dnn-aec works
Language:Python0 0
wangyang199609/Dual-Path-Transformer-Network-PyTorch
Unofficial implementation of Dual-Path Transformer Network (DPTNet) for speech separation (Interspeech 2020)
Language:Python0 0
wangyang199609/facenet-pytorch
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
Language:Python0 0
wangyang199609/fucking-algorithm
刷算法全靠套路，认准 labuladong 就够了！English version supported! Crack LeetCode, not only how, but also why.
0 0
wangyang199609/learngit
Language:Python1 0
wangyang199609/Lipreading_using_Temporal_Convolutional_Networks
ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks
Language:Python0 0
wangyang199609/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
wangyang199609/MuSE
Language:Python0 0
wangyang199609/ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
Language:C++0 0
wangyang199609/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:Python0 0
wangyang199609/RIR-Generator
Generating room impulse responses
Language:C++0 0
wangyang199609/rir-generator-1
Language:Python0 0
wangyang199609/rnnoise
Recurrent neural network for audio noise reduction
Language:C0 0
wangyang199609/speaker_extraction_SpEx
multi-scale time domain speaker extraction
Language:Python0 0
wangyang199609/speech-demo.github.io
wangyang199609/SpeechAlgorithms
Speech Algorithms ， from 语音算法组
Language:C0 0
wangyang199609/speechbrain
A PyTorch-based Speech Toolkit
wangyang199609/traditional-speech-enhancement
语音增强传统方法
Language:MATLAB0 0
wangyang199609/Tutorial_Separation
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
Language:MATLAB0 0
wangyang199609/v2rayNvpn
翻墙、免费翻墙、免费科学上网、免费节点、免费梯子、免费ss/ssr/v2ray/trojan节点、蓝灯、谷歌商店、翻墙梯子、外网游戏、国外游戏、vpn、vpn推荐、每天更新、上外网、外网、V2rayN、Qv2ray、V2rayW、V2RayS、Mellow、V2rayX、V2rayU、ClashX、Kitsunebi、BifrostV、i2Ray 、Quantumult、Surge 4、winXray、Qv2ray、Kitsunebi、Trojan-Qt5、代理服务器、机场、马里奥、魔兽世界、poshMark、亚马逊、虾皮、煤炉、Mercari、外贸
0 0
wangyang199609/VoViT
VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer
Language:Python0 0
wangyang199609/WebRTC_NS
Noise Suppression Module Port From WebRTC
Language:C0 0
wangyang199609/youtube-dl
Command-line program to download videos from YouTube.com and other video sites
Language:Python0 0
wangyang199609/yt-dlp
A youtube-dl fork with additional features and fixes
Language:Python0 0