nvidia-nemo

There are 16 repositories under nvidia-nemo topic.

Rumeysakeskin/Turkish-Text-to-Speech
Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan
Language:Python58 6 56
cr4yfish/nouv
Free AI & Community powered Learning Experience
Language:TypeScript40 1 130
GoogleCloudPlatform/nvidia-nemo-on-gke
Training NVIDIA NeMo Megatron Large Language Model (LLM) using NeMo Framework on Google Kubernetes Engine
Language:HCL12 21 16
bunyaminergen/WavLMMSDD
This repository combines `WavLM`, a powerful speech representation model from Microsoft, with `MSDD` (Multi-Scale Diarization Decoder), a state-of-the-art approach for speaker diarization from Nvidia.
Language:Jupyter Notebook8 4 03
Rumeysakeskin/Question-Answering-BERT
Extractive Question-Answering with BERT on SQuAD v2.0 (Stanford Question Answering Dataset) using NVIDIA PyTorch Lightning
Language:Jupyter Notebook8 1 00
Rumeysakeskin/ASR-Quantization
Post-training quantization on Nvidia Nemo ASR model
Language:Jupyter Notebook7 1 0
KevinGeLe/SmartSRT
📄 SmartSRT is a command-line tool for generating accurate subtitles with per-word timestamps. It uses WhisperAI for speech transcription, NVIDIA NeMo for diarization, and OpenCV for face recognition. The program is good at creating high accuracy subtitles. 🎧💻⚙️
6 5 11
denizariyan/Real-Time-Auto-Transcriber
Automatic transcriber made with the Nvidia NeMo AI toolkit. Used to transcribe speech to text in real-time from any source. Requires CUDA capable GPU to run on the local machine, if setup using virtual audio cables can transcribe the audio that is being played in real-time without any other requirements.
Language:Python4 1 01
JINHXu/tutorial-speaker-identification-with-nemo
The simplest & most comprehensible tutorial on speaker identification with NVIDIA's `Nemo`.
Language:Python3 1 05
HROlive/Poland-End-To-End-LLM-Bootcamp
This bootcamp is designed to give NLP researchers an end-to-end overview on the fundamentals of NVIDIA NeMo framework, complete solution for building large language models. It will also have hands-on exercises complimented by tutorials, code snippets, and presentations to help researchers kick-start with NeMo LLM Service and Guardrails.
Language:Jupyter Notebook2 1 01
j3soon/LLM-Tutorial
LLM tutorial materials include but not limited to NVIDIA NeMo, TensorRT-LLM, Triton Inference Server, and NeMo Guardrails.
Language:Jupyter Notebook2 1 11
aaaastark/NeMo-WeightsBiases-TTS
Training and Tunning a Text to speech model with Nvidia NeMo and Weights and Biases
Language:Jupyter Notebook1 2 0
ssharkov03/ru-speech-recognition
Module for russian speech recognition using NVIDIA Nemo.
Language:Python1 1 00
transiteration/stt_kz_quartznet15x5
Implementation of a Kazakh Speech-to-Text Model using the NVIDIA NeMo toolkit for efficient transcription of spoken Kazakh speech into text.
Language:Python1 1 00
GameOfPods/PAT
PodcastProject Analytics Toolkit - Project that creates analytics various input data. Exported data is intended to be used in a PodcastProject website
Language:Python0 0 00
InfiniteHelios/nemo-audio-profanity-detector-app
Audio profanity detector desktop app developed with PyQt5 using NVidia-Nemo tech.
Language:Python0 1 00

nvidia-nemo

Rumeysakeskin/Turkish-Text-to-Speech

cr4yfish/nouv

GoogleCloudPlatform/nvidia-nemo-on-gke

bunyaminergen/WavLMMSDD

Rumeysakeskin/Question-Answering-BERT

Rumeysakeskin/ASR-Quantization

KevinGeLe/SmartSRT

denizariyan/Real-Time-Auto-Transcriber

JINHXu/tutorial-speaker-identification-with-nemo

HROlive/Poland-End-To-End-LLM-Bootcamp

j3soon/LLM-Tutorial

aaaastark/NeMo-WeightsBiases-TTS

ssharkov03/ru-speech-recognition

transiteration/stt_kz_quartznet15x5

GameOfPods/PAT

InfiniteHelios/nemo-audio-profanity-detector-app