Pinned Repositories
3-min-pytorch
[WIP][준비중] "3분 딥러닝 파이토치맛" 예제 코드
AAS_enhancement
This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervision".
aasist
Official PyTorch implementation of "AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks"
DeepSpeech
A TensorFlow implementation of Baidu's DeepSpeech architecture
espnet
End-to-End Speech Processing Toolkit
kaldi
This is now the official location of the Kaldi project.
pwc
Papers with code. Sorted by stars. Updated weekly.
pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
returnn-experiments
experiments with RETURNN
wav2letter
Facebook AI Research Automatic Speech Recognition Toolkit
hiyoung-asr's Repositories
hiyoung-asr/CleanUNet
Official PyTorch Implementation of CleanUNet (ICASSP 2022)
hiyoung-asr/ctm
hiyoung-asr/DBPNet
DBPNet model
hiyoung-asr/DeepFilterNet
Noise supression using deep filtering
hiyoung-asr/DiariST
hiyoung-asr/IMTL
Multi-task learning for end-to-end speech translation.
hiyoung-asr/jsalt2020_simulate
Training data simulation
hiyoung-asr/mamba
hiyoung-asr/Mamba-TasNet
hiyoung-asr/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
hiyoung-asr/MossFormer
This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Joint Self-Attentions", which was submitted to ICASSP 2023.
hiyoung-asr/MossFormer2
This is the audio sample repository for speech separation model "MossFormer2".
hiyoung-asr/MP-SENet
MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra
hiyoung-asr/OpenVoice
Instant voice cloning by MyShell.
hiyoung-asr/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
hiyoung-asr/reverberation-as-supervision
Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation
hiyoung-asr/S4M
Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models
hiyoung-asr/SEMamba
This is the official implementation of the SEMamba paper.
hiyoung-asr/sgmse-bbed
TODO
hiyoung-asr/sgmse_crp
hiyoung-asr/SIG-Challenge
hiyoung-asr/SPMamba
hiyoung-asr/SSpaVAlDo
hiyoung-asr/storm
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
hiyoung-asr/tf-locoformer
Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
hiyoung-asr/torchsde
Differentiable SDE solvers with GPU support and efficient sensitivity analysis.
hiyoung-asr/tssep
TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings
hiyoung-asr/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
hiyoung-asr/VPIDM
This is official repository of new SOTA diffusion models based method for speech enhancement
hiyoung-asr/whisper-medusa
Whisper with Medusa heads