BingYang-20

Multi-channel Audio Signal Processing, Sound Source Localization, Self-Supervised Learning

Westlake University

BingYang-20's Stars

facebookresearch/detr
End-to-End Object Detection with Transformers
Language:Python13.7k 148 5262.5k
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Jupyter Notebook12k 96 3471.5k
google-research/vision_transformer
Language:Jupyter Notebook10.6k 105 2071.3k
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Language:Python8.5k 36 2981k
LCAV/pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
Language:Python1.5k 41 239432
SamLynnEvans/Transformer
Transformer seq2seq model, program that can build a language translator from parallel corpus
Language:Python1.4k 19 34350
NVlabs/DG-Net
:couple: Joint Discriminative and Generative Learning for Person Re-identification. CVPR'19 (Oral) :couple:
Language:Python1.3k 30 78228
YuanGongND/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
Language:Jupyter Notebook1.2k 17 137221
microsoft/DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
Language:Python1.1k 49 150414
DavidDiazGuerra/gpuRIR
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
Language:Cuda493 10 5596
RoyJames/room-impulse-responses
A list of publicly available room impulse response datasets and scripts to download them.
Language:Shell416 7 034
Audio-WestlakeU/NBSS
The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
Language:Python234 7 3626
detly/gammatone
Gammatone-based spectrograms, using gammatone filterbanks or Fourier transform weightings.
Language:Matlab220 20 1168
pytorch/torcheval
A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to facilitate metric computation in distributed training and tools for PyTorch model evaluations.
Language:Python218 15 3348
nttcslab/byol-a
BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation
Language:Python206 10 1635
tencent-ailab/FRA-RIR
Language:Python176 8 827
Audio-WestlakeU/FN-SSL
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization
Language:Python91 5 810
maj4e/pyrirtool
Measuring room impulse responses with python and sounddevice
Language:Python70 1 116
BingYang-20/SRP-DNN
A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]
Language:Python41 4 114
anton-jeran/TS-RIR
Translating Synthetic RIRs to Real RIRs
Language:Python40 3 09
Chutlhu/dEchorate
Da - ECHO - RetrievAl - daTasEt
Language:Jupyter Notebook25 3 64
sharathadavanne/hungarian-net
Deep-learning-based implementation of the popular Hungarian algorithm that helps solve the assignment problem.
Language:Python24 2 23
BingYang-20/SAR-SSL
A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Multi-Channel Conformer”
Language:Python16 2 21

BingYang-20

BingYang-20's Stars

facebookresearch/detr

CompVis/latent-diffusion

google-research/vision_transformer

lucidrains/denoising-diffusion-pytorch

LCAV/pyroomacoustics

SamLynnEvans/Transformer

NVlabs/DG-Net

YuanGongND/ast

microsoft/DNS-Challenge

DavidDiazGuerra/gpuRIR

RoyJames/room-impulse-responses

Audio-WestlakeU/NBSS

detly/gammatone

pytorch/torcheval

nttcslab/byol-a

tencent-ailab/FRA-RIR

Audio-WestlakeU/FN-SSL

maj4e/pyrirtool

BingYang-20/SRP-DNN

anton-jeran/TS-RIR

Chutlhu/dEchorate

sharathadavanne/hungarian-net

BingYang-20/SAR-SSL