hiyoung-asr

Pinned Repositories

3-min-pytorch
[WIP][준비중] "3분 딥러닝 파이토치맛" 예제 코드
Language:Jupyter Notebook0 0 00
AAS_enhancement
This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervision".
Language:Python0 0 00
aasist
Official PyTorch implementation of "AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks"
Language:Python0 0 00
DeepSpeech
A TensorFlow implementation of Baidu's DeepSpeech architecture
Language:C++0 0 00
espnet
End-to-End Speech Processing Toolkit
Language:Shell0 0 00
kaldi
This is now the official location of the Kaldi project.
Language:Shell0 0 00
pwc
Papers with code. Sorted by stars. Updated weekly.
1 0 00
pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Language:Python0 0 00
returnn-experiments
experiments with RETURNN
Language:Python0 0 00
wav2letter
Facebook AI Research Automatic Speech Recognition Toolkit
Language:Lua0 0 00

hiyoung-asr's Repositories

hiyoung-asr/CleanUNet
Official PyTorch Implementation of CleanUNet (ICASSP 2022)
Language:Python0 0
hiyoung-asr/ctm
Language:Python0 0
hiyoung-asr/DBPNet
DBPNet model
hiyoung-asr/DeepFilterNet
Noise supression using deep filtering
Language:Python0 0
hiyoung-asr/DiariST
Language:Python0 0
hiyoung-asr/IMTL
Multi-task learning for end-to-end speech translation.
Language:Python0 0
hiyoung-asr/jsalt2020_simulate
Training data simulation
Language:Python0 0
hiyoung-asr/mamba
Language:Python0 0
hiyoung-asr/Mamba-TasNet
Language:Jupyter Notebook0 0
hiyoung-asr/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
Language:Jupyter Notebook0 0
hiyoung-asr/MossFormer
This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Joint Self-Attentions", which was submitted to ICASSP 2023.
0 0
hiyoung-asr/MossFormer2
This is the audio sample repository for speech separation model "MossFormer2".
Language:Python0 0
hiyoung-asr/MP-SENet
MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra
Language:Python0 0
hiyoung-asr/OpenVoice
Instant voice cloning by MyShell.
Language:Python0 0
hiyoung-asr/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook0 0
hiyoung-asr/reverberation-as-supervision
Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation
Language:Python0 0
hiyoung-asr/S4M
Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models
Language:Python0 0
hiyoung-asr/SEMamba
This is the official implementation of the SEMamba paper.
0 0
hiyoung-asr/sgmse-bbed
TODO
Language:Python0 0
hiyoung-asr/sgmse_crp
Language:Python0 0
hiyoung-asr/SIG-Challenge
hiyoung-asr/SPMamba
Language:Python0 0
hiyoung-asr/SSpaVAlDo
Language:Jupyter Notebook0 0
hiyoung-asr/storm
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
Language:Python0 0
hiyoung-asr/tf-locoformer
Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
Language:Python0 0
hiyoung-asr/torchsde
Differentiable SDE solvers with GPU support and efficient sensitivity analysis.
Language:Python0 0
hiyoung-asr/tssep
TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings
Language:Python0 0
hiyoung-asr/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python0 0
hiyoung-asr/VPIDM
This is official repository of new SOTA diffusion models based method for speech enhancement
Language:Python0 0
hiyoung-asr/whisper-medusa
Whisper with Medusa heads
Language:Python0 0