gongouveia

MSc. Engineering Physics. AI/ML Software Engineer, Interested in ML Optimization, Quantization, and making efficient AI

Universidade de Coimbra Portugal, Lisbon, Porto

gongouveia's Stars

Jose-Sabater/whisper-pyannote
Whisper from OpenAi and diarization with Pyannote
Language:Python282
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python68.2k8.1k
hcmlab/vadnet
Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks
Language:Python42377
Vaibhavs10/fast-whisper-finetuning
Language:Jupyter Notebook43636
davabase/whisper_real_time
Real time transcription with OpenAI Whisper.
Language:Python2.3k385
doveg/whisper-real-time
A real time offline transcriber with gui, based on OpenAI whisper
Language:Python112
microsoft/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Language:C++14.2k2.9k
fleek/VADtransciber
Language:Python329
awexandrr/audioWhisper
Listen to any audio stream on your machine and print out the transcribed or translated audio.
Language:Python11212
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Language:Python4.1k402
javedali99/audio-to-text-transcription
This repository contains a Python script that allows users to download the audio from a YouTube video, transcribe it into text, detect the language and save the transcription in txt file automatically.
Language:Python11718
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Language:Python32.5k2.4k
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python11.7k1.2k
EncoraDigital/SAB-cnn-audio-denoiser
Tensorflow 2.0 implementation of the paper: A Fully Convolutional Neural Network for Speech Enhancement
Language:Jupyter Notebook25478
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Language:Python3.5k281
hujinsen/pytorch-StarGAN-VC
Fully reproduce the paper of StarGAN-VC. Stable training and Better audio quality .
Language:Python24557
seth814/open-unmix-pytorch
Open-Unmix - Music Source Separation for PyTorch
66
nateshmbhat/pyttsx3
Offline Text To Speech synthesis for python
Language:Python2.1k329
roboflow/notebooks
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.
Language:Jupyter Notebook5.3k811
facebookresearch/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Language:Python1.6k301
tomas-fryza/vhdl-course
VHDL course at Brno University of Technology
Language:Tcl93230
dahmadjid/spi_master_fpga
SPI Master RTL on fpga. ESP32 was the slave. highly reliable, tested upto 10MHz and 512 bits for transaction length
Language:VHDL4
jakubcabal/spi-fpga
SPI master and SPI slave for FPGA written in VHDL
Language:VHDL16338
duartegalvao/ArduZynq-Tutorials
Simple tutorials for getting started with programming on Trenz ArduZynq boards.
61
jtfell/c-fft
C-Implementations of FFT Algorithms.
Language:C3617
mahmoudparsian/pyspark-tutorial
PySpark-Tutorial provides basic algorithms using PySpark
Language:Jupyter Notebook1.2k469
ricardo-jasinski/vhdl-csv-file-reader
VHDL package for reading formatted data from comma-separated-values (CSV) files
Language:VHDL235
marmelo/tech-companies-in-portugal
:portugal: List of technology companies in Portugal.
1.3k203
milekium/spidev-lib
simple spidev c/c++ wrapper library
Language:C++4232
eiWare/LibPiSPI
C++ SPI Library for Raspberry Pi
Language:C++62

gongouveia

gongouveia's Stars

Jose-Sabater/whisper-pyannote

openai/whisper

hcmlab/vadnet

Vaibhavs10/fast-whisper-finetuning

davabase/whisper_real_time

doveg/whisper-real-time

microsoft/onnxruntime

fleek/VADtransciber

awexandrr/audioWhisper

snakers4/silero-vad

javedali99/audio-to-text-transcription

gradio-app/gradio

m-bain/whisperX

EncoraDigital/SAB-cnn-audio-denoiser

huggingface/distil-whisper

hujinsen/pytorch-StarGAN-VC

seth814/open-unmix-pytorch

nateshmbhat/pyttsx3

roboflow/notebooks

facebookresearch/denoiser

tomas-fryza/vhdl-course

dahmadjid/spi_master_fpga

jakubcabal/spi-fpga

duartegalvao/ArduZynq-Tutorials

jtfell/c-fft

mahmoudparsian/pyspark-tutorial

ricardo-jasinski/vhdl-csv-file-reader

marmelo/tech-companies-in-portugal

milekium/spidev-lib

eiWare/LibPiSPI