BingYang-20
Multi-channel Audio Signal Processing, Sound Source Localization, Self-Supervised Learning
Westlake University
BingYang-20's Stars
facebookresearch/detr
End-to-End Object Detection with Transformers
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
google-research/vision_transformer
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
LCAV/pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
SamLynnEvans/Transformer
Transformer seq2seq model, program that can build a language translator from parallel corpus
NVlabs/DG-Net
:couple: Joint Discriminative and Generative Learning for Person Re-identification. CVPR'19 (Oral) :couple:
YuanGongND/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
microsoft/DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
DavidDiazGuerra/gpuRIR
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
RoyJames/room-impulse-responses
A list of publicly available room impulse response datasets and scripts to download them.
Audio-WestlakeU/NBSS
The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
detly/gammatone
Gammatone-based spectrograms, using gammatone filterbanks or Fourier transform weightings.
pytorch/torcheval
A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to facilitate metric computation in distributed training and tools for PyTorch model evaluations.
nttcslab/byol-a
BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation
tencent-ailab/FRA-RIR
Audio-WestlakeU/FN-SSL
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization
maj4e/pyrirtool
Measuring room impulse responses with python and sounddevice
BingYang-20/SRP-DNN
A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]
anton-jeran/TS-RIR
Translating Synthetic RIRs to Real RIRs
Chutlhu/dEchorate
Da - ECHO - RetrievAl - daTasEt
sharathadavanne/hungarian-net
Deep-learning-based implementation of the popular Hungarian algorithm that helps solve the assignment problem.
BingYang-20/SAR-SSL
A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Multi-Channel Conformer”