underdogliu

Tokyo, Japan

Pinned Repositories

a_dcf
a-DCF: an architecture agnostic metric
Language:Python0 0 00
arfit-python
This is a pure-python intepretation of ARfit toolkit, based on numpy (so far)
Language:Python20
ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Language:Python1 0 00
Interview-Notebook
:books: 技术面试需要掌握的基础知识整理
10
kaldi
This is now the official location of the Kaldi project.
Language:Shell0 0 01
LFT
This is the class information for my ISCAS 2021 paper. Needs more work on training scripts.
Language:Python1 1 10
MSFT_CLAP
Learning audio concepts from natural language supervision
Language:Python20
torch-dct
DCT (discrete cosine transform) functions for pytorch
Language:Python4 0 00
wespeaker
Research and Production Oriented Speaker Recognition Toolkit
Language:Python1 0 00

underdogliu's Repositories

underdogliu/MSFT_CLAP
Learning audio concepts from natural language supervision
Language:Python20
underdogliu/ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Language:Python1 0 00
underdogliu/wespeaker
Research and Production Oriented Speaker Recognition Toolkit
Language:Python1 0 00
underdogliu/a_dcf
a-DCF: an architecture agnostic metric
Language:Python0 0 00
underdogliu/aasist-bonafide
WIP. Repo for the paper "Speaker-aware Anti-Spoofing".
00
underdogliu/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python0 0 00
underdogliu/asteroid
The PyTorch-based audio source separation toolkit for researchers
Language:Python00
underdogliu/asvspoof-2021-baselines
ASVspoof 2021 Baseline Systems
Language:Python00
underdogliu/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
Language:Python0 0 00
underdogliu/dihard3_baseline
Language:Perl0 0 00
underdogliu/kaldi
This is now the official location of the Kaldi project.
Language:Shell0 0 01
underdogliu/ASVSpoof5-SASVBaseline
Language:Python
underdogliu/coqui-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python0 0
underdogliu/dm-haiku
JAX-based neural network library
underdogliu/espnet
End-to-End Speech Processing Toolkit
Language:Python0 0
underdogliu/jtubespeech
underdogliu/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook
underdogliu/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
underdogliu/nnsvs
Neural network-based singing voice synthesis library for research
Language:Python
underdogliu/open_clip
An open source implementation of CLIP.
Language:Jupyter Notebook0 0
underdogliu/project-NN-Pytorch-scripts
Language:Python0 0
underdogliu/promptbench
A unified evaluation framework for large language models
underdogliu/s3prl
Audio Foundation Models (Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit)
Language:Python0 0
underdogliu/SASVC2022_Baseline
Baseline for the Spoofing-aware Speaker Verification Challenge 2022
Language:Python
underdogliu/speechbrain
A PyTorch-based Speech Toolkit
Language:Python0 01
underdogliu/underdogliu
underdogliu/underdogliu.github.io
✨ Build a beautiful and simple website in literally minutes. Demo at https://beautifuljekyll.com
Language:HTML0 0
underdogliu/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
underdogliu/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
underdogliu/VBx
Variational Bayes HMM over x-vectors diarization
Language:Python