speaker-diariazation

There are 3 repositories under speaker-diariazation topic.

  • NVIDIA/NeMo

    A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

    Language:Python12.2k2072.3k2.5k
  • tim-roethig-db/amondin

    A simple and private transcription tool able to segment speakers and convert audio to text.

    Language:Python1100
  • ZhaZhaFon/repo_spectralclustering

    说话人分割仓库-聚类分割-谱聚类 || a ready-to-use repo for Speaker Diariazation with Spectral Clustering

    Language:Jupyter Notebook1000