jyhan03's Stars
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
google-research/google-research
Google Research
Lightning-AI/pytorch-lightning
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
EvanLi/Github-Ranking
:star:Github Ranking:star: Github stars and forks ranking list. Github Top100 stars list of different languages. Automatically update daily. | Github仓库排名,每日自动更新
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
lhotse-speech/lhotse
Tools for handling speech data in machine learning projects.
jia-zhuang/pytorch-multi-gpu-training
整理 pytorch 单机多 GPU 训练方法与原理
yangdongchao/UniAudio
The Open Source Code of UniAudio
huggingface/diarizers
DongKeon/Awesome-Speaker-Diarization
Some comprehensive papers about speaker diarization
tango4j/Auto-Tuning-Spectral-Clustering
This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"
desh2608/gss
A simple package for Guided source separation (GSS)
pyf98/DPHuBERT
INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"
Audio-WestlakeU/FS-EEND
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]
haoxiangsnr/spiking-fullsubnet
Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.
BUTSpeechFIT/EEND_dataprep
jsalt2020-asrdiar/jsalt2020_simulate
Training data simulation
Audio-WestlakeU/pytorch_lightning_template_for_beginners
A pytorch template for beginners based on pytorch_lightning
fgnt/graph_pit
joonaskalda/PixIT
Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings" published at Odyssey 2024
IdoAmos/not-from-scratch
Mu-Y/DiariST
haoxiangsnr/audioinfo
A small tool to calculate the distribution of audio durations in a directory
qiuqiangkong/materials_for_students
UDASE-CHiME2023/baseline
Baselines for the UDASE task of the CHiME-7 challenge
wsstriving/DiarizersLM