nvidia-nemo

There are 16 repositories under nvidia-nemo topic.

  • Rumeysakeskin/Turkish-Text-to-Speech

    Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan

    Language:Python58656
  • cr4yfish/nouv

    Free AI & Community powered Learning Experience

    Language:TypeScript401130
  • GoogleCloudPlatform/nvidia-nemo-on-gke

    Training NVIDIA NeMo Megatron Large Language Model (LLM) using NeMo Framework on Google Kubernetes Engine

    Language:HCL122116
  • bunyaminergen/WavLMMSDD

    This repository combines `WavLM`, a powerful speech representation model from Microsoft, with `MSDD` (Multi-Scale Diarization Decoder), a state-of-the-art approach for speaker diarization from Nvidia.

    Language:Jupyter Notebook8403
  • Rumeysakeskin/Question-Answering-BERT

    Extractive Question-Answering with BERT on SQuAD v2.0 (Stanford Question Answering Dataset) using NVIDIA PyTorch Lightning

    Language:Jupyter Notebook8100
  • Rumeysakeskin/ASR-Quantization

    Post-training quantization on Nvidia Nemo ASR model

    Language:Jupyter Notebook710
  • KevinGeLe/SmartSRT

    📄 SmartSRT is a command-line tool for generating accurate subtitles with per-word timestamps. It uses WhisperAI for speech transcription, NVIDIA NeMo for diarization, and OpenCV for face recognition. The program is good at creating high accuracy subtitles. 🎧💻⚙️

  • denizariyan/Real-Time-Auto-Transcriber

    Automatic transcriber made with the Nvidia NeMo AI toolkit. Used to transcribe speech to text in real-time from any source. Requires CUDA capable GPU to run on the local machine, if setup using virtual audio cables can transcribe the audio that is being played in real-time without any other requirements.

    Language:Python4101
  • JINHXu/tutorial-speaker-identification-with-nemo

    The simplest & most comprehensible tutorial on speaker identification with NVIDIA's `Nemo`.

    Language:Python3105
  • HROlive/Poland-End-To-End-LLM-Bootcamp

    This bootcamp is designed to give NLP researchers an end-to-end overview on the fundamentals of NVIDIA NeMo framework, complete solution for building large language models. It will also have hands-on exercises complimented by tutorials, code snippets, and presentations to help researchers kick-start with NeMo LLM Service and Guardrails.

    Language:Jupyter Notebook2101
  • j3soon/LLM-Tutorial

    LLM tutorial materials include but not limited to NVIDIA NeMo, TensorRT-LLM, Triton Inference Server, and NeMo Guardrails.

    Language:Jupyter Notebook2111
  • aaaastark/NeMo-WeightsBiases-TTS

    Training and Tunning a Text to speech model with Nvidia NeMo and Weights and Biases

    Language:Jupyter Notebook120
  • ssharkov03/ru-speech-recognition

    Module for russian speech recognition using NVIDIA Nemo.

    Language:Python1100
  • transiteration/stt_kz_quartznet15x5

    Implementation of a Kazakh Speech-to-Text Model using the NVIDIA NeMo toolkit for efficient transcription of spoken Kazakh speech into text.

    Language:Python1100
  • GameOfPods/PAT

    PodcastProject Analytics Toolkit - Project that creates analytics various input data. Exported data is intended to be used in a PodcastProject website

    Language:Python0000
  • InfiniteHelios/nemo-audio-profanity-detector-app

    Audio profanity detector desktop app developed with PyQt5 using NVidia-Nemo tech.

    Language:Python0100