Pinned Repositories
a_dcf
a-DCF: an architecture agnostic metric
arfit-python
This is a pure-python intepretation of ARfit toolkit, based on numpy (so far)
ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Interview-Notebook
:books: 技术面试需要掌握的基础知识整理
kaldi
This is now the official location of the Kaldi project.
LFT
This is the class information for my ISCAS 2021 paper. Needs more work on training scripts.
MSFT_CLAP
Learning audio concepts from natural language supervision
torch-dct
DCT (discrete cosine transform) functions for pytorch
wespeaker
Research and Production Oriented Speaker Recognition Toolkit
underdogliu's Repositories
underdogliu/MSFT_CLAP
Learning audio concepts from natural language supervision
underdogliu/ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
underdogliu/wespeaker
Research and Production Oriented Speaker Recognition Toolkit
underdogliu/a_dcf
a-DCF: an architecture agnostic metric
underdogliu/aasist-bonafide
WIP. Repo for the paper "Speaker-aware Anti-Spoofing".
underdogliu/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
underdogliu/asteroid
The PyTorch-based audio source separation toolkit for researchers
underdogliu/asvspoof-2021-baselines
ASVspoof 2021 Baseline Systems
underdogliu/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
underdogliu/dihard3_baseline
underdogliu/kaldi
This is now the official location of the Kaldi project.
underdogliu/ASVSpoof5-SASVBaseline
underdogliu/coqui-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
underdogliu/dm-haiku
JAX-based neural network library
underdogliu/espnet
End-to-End Speech Processing Toolkit
underdogliu/jtubespeech
underdogliu/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
underdogliu/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
underdogliu/nnsvs
Neural network-based singing voice synthesis library for research
underdogliu/open_clip
An open source implementation of CLIP.
underdogliu/project-NN-Pytorch-scripts
underdogliu/promptbench
A unified evaluation framework for large language models
underdogliu/s3prl
Audio Foundation Models (Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit)
underdogliu/SASVC2022_Baseline
Baseline for the Spoofing-aware Speaker Verification Challenge 2022
underdogliu/speechbrain
A PyTorch-based Speech Toolkit
underdogliu/underdogliu
underdogliu/underdogliu.github.io
✨ Build a beautiful and simple website in literally minutes. Demo at https://beautifuljekyll.com
underdogliu/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
underdogliu/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
underdogliu/VBx
Variational Bayes HMM over x-vectors diarization