BongkiLee's Stars
0nutation/SpeechGPT
SpeechGPT Series: Speech Large Language Models
bytedance/SALMONN
SALMONN: Speech Audio Language Music Open Neural Network
YuanGongND/ltu
Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".
AudioLLMs/AudioBench
AudioBench: A Universal Benchmark for Audio Large Language Models
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
meta-llama/llama3
The official Meta Llama 3 GitHub site
nuniz/blind_rt60
Algorithm for blind estimation of reverberation time
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
likejazz/llama3.np
llama3.np is a pure NumPy implementation for Llama 3 model.
rtzr/Awesome-Korean-Speech-Recognition
한국어 음성인식 STT API 리스트. 각 성능 벤치마크.
alumae/sv_score_calibration
Score calibration for speaker verification
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
ductuantruong/enskd
Official implementation of the ICASSP 2024 paper: Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification
modelscope/3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
dynamic-superb/dynamic-superb
The official repository of Dynamic-SUPERB.
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Mishuni/Pip_Package_Practice
pip package deployment sample
Stability-AI/StableLM
StableLM: Stability AI Language Models
zyzisyz/mfa_conformer
Janghyun1230/Speaker_Verification
Tensorflow implementation of "Generalized End-to-End Loss for Speaker Verification"
TaoRuijie/ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
wenet-e2e/wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
v-iashin/VoxCeleb
An attempt to replicate the results of [1706.08612] VoxCeleb: a large-scale speaker identification dataset
JaesungHuh/VoxSRC2022
VoxSRC2022 workshop development kit
nikvaessen/w2v2-speaker
Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053
Exploration-Lab/COGMEN
Justin-A/DeepLearning101
Code about DeepLearning101 Book
yunjey/pytorch-tutorial
PyTorch Tutorial for Deep Learning Researchers
Audio-WestlakeU/audiossl
A library built for easier audio self-supervised training, downstream tasks evaluation
Sanyuan-Chen/CSS_with_Conformer
Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.