zqwang7's Stars
JusperLee/SonicSim
HaoFengyuan/X-TF-GridNet
The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", which is accepted by Information Fusion.
qiuqiangkong/mini_source_separation
merlresearch/tssep
TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings
hwdsl2/openvpn-install
OpenVPN server installer for Ubuntu, Debian, AlmaLinux, Rocky Linux, CentOS, Fedora, openSUSE, Amazon Linux 2 and Raspberry Pi OS
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
google/visqol
Perceptual Quality Estimator for speech and audio
maj4e/pyrirtool
Measuring room impulse responses with python and sounddevice
chimechallenge/CHiME6_falign
This repository contains forced alignment segmentation for the CHiME-6 dataset, in the context of the CHiME-7 DASR Challenge.
chimechallenge/chime6-synchronisation
Code for synchronising all CHiME-5 audio signals for use in CHiME-6
Wataru-Nakata/miipher
Unofficial implementation of miipher
apple/axlearn
An Extensible Deep Learning Library
fgnt/meeteval
MeetEval - A meeting transcription evaluation toolkit
asteroid-team/torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
iver56/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
tencent-ailab/FRA-RIR
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
IntelLabs/IntelNeuromorphicDNSChallenge
Intel Neuromorphic DNS Challenge
cocktail-fork/cocktail-fork.github.io
lijuncheng16/AudioTaggingDoneRight
experiments about AudioSet
audiolabs/torch-pesq
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
FrancoisGrondin/BIRD
Big Impulse Response Dataset
JunweiLiang/awesome_lists
Awesome Lists for Tenure-Track Assistant Professors and PhD students. (助理教授/博士生生存指南)
zqwang7/CausalityCheck
Causality Check in Frame-online Speech Separation
fakufaku/torchiva
Blind source separation with independent vector analysis family of algorithm in torch
cogmhear/avse_challenge
COG-MHEAR Audio-Visual Speech Enhancement Challenge
YuanGongND/ssast
Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".
YuanGongND/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
nttcslab-sp/dnn_wpe
dhgrs/chainer-WaveGlow
A Chainer implementation of WaveGlow.