Pinned Repositories
acoular
Library for acoustic beamforming
AEC
Acoustic Echo Cancellation with LMS/RLS (基于LMS/RLS的自适应回声抵消)
Alibaba-MIT-Speech
Alibaba speech technology
asteroid
The PyTorch-based audio source separation toolkit for researchers || Pretrained models available
attention-transfer
Improving Convolutional Networks via Attention Transfer (ICLR 2017)
attention_is_all_you_need
[WIP] Attention Is All You Need (Vaswani et al. 2017) by Chainer.
AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
audioset
Fetch and use Google's AudioSet dataset
horovod
Distributed training framework for TensorFlow.
Two-dimensional-Self-attention-based-Speech-Enhancement
A 2-dimensional Self-attention-based Solution with Cooperative Gated Convolutional Modules for Speech Enhancement
chenxinglili's Repositories
chenxinglili/Two-dimensional-Self-attention-based-Speech-Enhancement
A 2-dimensional Self-attention-based Solution with Cooperative Gated Convolutional Modules for Speech Enhancement
chenxinglili/asteroid
The PyTorch-based audio source separation toolkit for researchers || Pretrained models available
chenxinglili/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
chenxinglili/av-se
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
chenxinglili/awesome-audio-visual
A curated list of different papers and datasets in various areas of audio-visual processing
chenxinglili/bark
🔊 Text-Prompted Generative Audio Model
chenxinglili/DARCN
The implementation of "A Recursive Network with Dynamic Attention for Monaural Speech Enhancement"
chenxinglili/DCUNetTorchSound
Implementation of Phase-aware speech enhancement with deep complex U-Net
chenxinglili/DeepComplexCRN
chenxinglili/ganhacks
starter from "How to Train a GAN?" at NIPS2016
chenxinglili/GC3
chenxinglili/KAIR
Image Restoration Toolbox (PyTorch). Training and testing codes for DPIR, USRNet, DnCNN, FFDNet, SRMD, DPSR, BSRGAN
chenxinglili/Listening-to-Sound-of-Silence-for-Speech-Denoising
[NeurIPS 2020] Official repository for the project "Listening to Sound of Silence for Speech Denoising"
chenxinglili/MSNet
Multi-scale speech enhancement
chenxinglili/performer-pytorch
An implementation of Performer, a linear attention-based transformer, in Pytorch
chenxinglili/pika
a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi
chenxinglili/python-pesq
PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)
chenxinglili/pytorch-optimizer
torch-optimizer -- collection of optimizers for Pytorch
chenxinglili/pytorch_cpp
Deep Learning sample programs using PyTorch in C++
chenxinglili/recommended-books
计算机经典书籍推荐 部分书籍提供PDF下载
chenxinglili/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
chenxinglili/SDNet
Speaker and Direction Inferred Dual-channel Speech Separation
chenxinglili/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
chenxinglili/singing_transcription_ICASSP2021
The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"
chenxinglili/sms_wsj
SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition
chenxinglili/SpeechTransProgress
Tracking the progress in end-to-end speech translation
chenxinglili/spleeter
Deezer source separation library including pretrained models.
chenxinglili/Subband-Music-Separation
Pytorch: Channel-wise subband input for better voice and accompaniment separation
chenxinglili/traditional-speech-enhancement
语音增强传统方法
chenxinglili/WeTS
A benchmark for the task of translation suggestion