alibabasglab

Alibaba Group Speech Lab, Singapore

Alibaba GroupSingapore

Pinned Repositories

cLDM-DCL
30
ClearerVoice-Studio
ClearVoice
Language:Python10
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python10
D2Former
This repository contains the audio samples for "D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech Enhancement" which is submitted to ICASSP 2023.
Language:Python36 1 46
fig_resources
2 1 00
FRCRN
137 3 1012
GatedFormer
This is the repository for the speech enhancement model SyncFormer
9 3 10
MossFormer
This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Joint Self-Attentions", which was submitted to ICASSP 2023.
89 3 78
MossFormer2
This is the audio sample repository for speech separation model "MossFormer2".
Language:Python119 4 79
ClearerVoice-Studio
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Language:Python2k 24 45144

alibabasglab's Repositories

alibabasglab/FRCRN
137 3 1012
alibabasglab/MossFormer2
This is the audio sample repository for speech separation model "MossFormer2".
Language:Python119 4 79
alibabasglab/MossFormer
This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Joint Self-Attentions", which was submitted to ICASSP 2023.
89 3 78
alibabasglab/D2Former
This repository contains the audio samples for "D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech Enhancement" which is submitted to ICASSP 2023.
Language:Python36 1 46
alibabasglab/GatedFormer
This is the repository for the speech enhancement model SyncFormer
9 3 10
alibabasglab/cLDM-DCL
30
alibabasglab/fig_resources
2 1 00
alibabasglab/ClearerVoice-Studio
ClearVoice
Language:Python10
alibabasglab/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python10
alibabasglab/FLASH-pytorch
Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"
Language:Python1 0 00
alibabasglab/MS-SNSD
The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) levels desired.
Language:HTML1 0 00
alibabasglab/speechbrain
A PyTorch-based Speech Toolkit
Language:Python1 0 00
alibabasglab/TAC
transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.
Language:Python1 0 00
alibabasglab/tts
Bilingual and Code-Switching Speech Synthesis
Language:HTML1 0 0
alibabasglab/vc
cross-lingual voice conversion
Language:HTML1

alibabasglab

Pinned Repositories

cLDM-DCL

ClearerVoice-Studio

CosyVoice

D2Former

fig_resources

FRCRN

GatedFormer

MossFormer

MossFormer2

ClearerVoice-Studio

alibabasglab's Repositories

alibabasglab/FRCRN

alibabasglab/MossFormer2

alibabasglab/MossFormer

alibabasglab/D2Former

alibabasglab/GatedFormer

alibabasglab/cLDM-DCL

alibabasglab/fig_resources

alibabasglab/ClearerVoice-Studio

alibabasglab/CosyVoice

alibabasglab/FLASH-pytorch

alibabasglab/MS-SNSD

alibabasglab/speechbrain

alibabasglab/TAC

alibabasglab/tts

alibabasglab/vc