jyhan03

Audio & Speech Processing

Brno University of Technology

jyhan03's Stars

facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook47.3k 306 6665.6k
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Python39k 443 3125k
google-research/google-research
Google Research
Language:Jupyter Notebook34.1k 749 1.3k7.9k
Lightning-AI/pytorch-lightning
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
Language:Python28.3k 250 7.1k3.4k
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
18.5k 372 241.5k
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Language:Python7.9k 97 1.6k959
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
7.5k 331 264908
EvanLi/Github-Ranking
:star:Github Ranking:star: Github stars and forks ranking list. Github Top100 stars list of different languages. Automatically update daily. | Github仓库排名，每日自动更新
Language:Python6.9k 79 26451
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook6.2k 71 991768
lhotse-speech/lhotse
Tools for handling speech data in machine learning projects.
Language:Python944 44 417216
jia-zhuang/pytorch-multi-gpu-training
整理 pytorch 单机多 GPU 训练方法与原理
Language:Python764 5 884
yangdongchao/UniAudio
The Open Source Code of UniAudio
Language:Python518 37 3332
huggingface/diarizers
Language:Python253 5 816
DongKeon/Awesome-Speaker-Diarization
Some comprehensive papers about speaker diarization
212 11 03
tango4j/Auto-Tuning-Spectral-Clustering
This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"
Language:Python108 7 215
desh2608/gss
A simple package for Guided source separation (GSS)
Language:Python107 5 813
pyf98/DPHuBERT
INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"
Language:Python104 6 59
Audio-WestlakeU/FS-EEND
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]
Language:Python81 3 114
haoxiangsnr/spiking-fullsubnet
Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.
Language:Python69 3 912
BUTSpeechFIT/EEND_dataprep
Language:Shell49 5 87
jsalt2020-asrdiar/jsalt2020_simulate
Training data simulation
Language:Python42 10 86
Audio-WestlakeU/pytorch_lightning_template_for_beginners
A pytorch template for beginners based on pytorch_lightning
Language:Python36 3 05
fgnt/graph_pit
Language:Python33 6 38
joonaskalda/PixIT
Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings" published at Odyssey 2024
Language:Python32 6 31
IdoAmos/not-from-scratch
Language:Python24 1 10
Mu-Y/DiariST
Language:Python18 4 03
haoxiangsnr/audioinfo
A small tool to calculate the distribution of audio durations in a directory
Language:Python13 1 01
qiuqiangkong/materials_for_students
13 1 1
UDASE-CHiME2023/baseline
Baselines for the UDASE task of the CHiME-7 challenge
Language:Python12 2 00
wsstriving/DiarizersLM
Language:Python10

jyhan03

jyhan03's Stars

facebookresearch/segment-anything

Stability-AI/stablediffusion

google-research/google-research

Lightning-AI/pytorch-lightning

Hannibal046/Awesome-LLM

huggingface/accelerate

HumanAIGC/EMO

EvanLi/Github-Ranking

pyannote/pyannote-audio

lhotse-speech/lhotse

jia-zhuang/pytorch-multi-gpu-training

yangdongchao/UniAudio

huggingface/diarizers

DongKeon/Awesome-Speaker-Diarization

tango4j/Auto-Tuning-Spectral-Clustering

desh2608/gss

pyf98/DPHuBERT

Audio-WestlakeU/FS-EEND

haoxiangsnr/spiking-fullsubnet

BUTSpeechFIT/EEND_dataprep

jsalt2020-asrdiar/jsalt2020_simulate

Audio-WestlakeU/pytorch_lightning_template_for_beginners

fgnt/graph_pit

joonaskalda/PixIT

IdoAmos/not-from-scratch

Mu-Y/DiariST

haoxiangsnr/audioinfo

qiuqiangkong/materials_for_students

UDASE-CHiME2023/baseline

wsstriving/DiarizersLM