speaker-diariazation

There are 3 repositories under speaker-diariazation topic.

NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python12.2k 207 2.3k2.5k
tim-roethig-db/amondin
A simple and private transcription tool able to segment speakers and convert audio to text.
Language:Python1 1 00
ZhaZhaFon/repo_spectralclustering
说话人分割仓库-聚类分割-谱聚类 || a ready-to-use repo for Speaker Diariazation with Spectral Clustering
Language:Jupyter Notebook1 0 00