gongouveia
MSc. Engineering Physics. AI/ML Software Engineer, Interested in ML Optimization, Quantization, and making efficient AI
Universidade de Coimbra Portugal, Lisbon, Porto
gongouveia's Stars
Jose-Sabater/whisper-pyannote
Whisper from OpenAi and diarization with Pyannote
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
hcmlab/vadnet
Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks
Vaibhavs10/fast-whisper-finetuning
davabase/whisper_real_time
Real time transcription with OpenAI Whisper.
doveg/whisper-real-time
A real time offline transcriber with gui, based on OpenAI whisper
microsoft/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
fleek/VADtransciber
awexandrr/audioWhisper
Listen to any audio stream on your machine and print out the transcribed or translated audio.
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
javedali99/audio-to-text-transcription
This repository contains a Python script that allows users to download the audio from a YouTube video, transcribe it into text, detect the language and save the transcription in txt file automatically.
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
EncoraDigital/SAB-cnn-audio-denoiser
Tensorflow 2.0 implementation of the paper: A Fully Convolutional Neural Network for Speech Enhancement
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
hujinsen/pytorch-StarGAN-VC
Fully reproduce the paper of StarGAN-VC. Stable training and Better audio quality .
seth814/open-unmix-pytorch
Open-Unmix - Music Source Separation for PyTorch
nateshmbhat/pyttsx3
Offline Text To Speech synthesis for python
roboflow/notebooks
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.
facebookresearch/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
tomas-fryza/vhdl-course
VHDL course at Brno University of Technology
dahmadjid/spi_master_fpga
SPI Master RTL on fpga. ESP32 was the slave. highly reliable, tested upto 10MHz and 512 bits for transaction length
jakubcabal/spi-fpga
SPI master and SPI slave for FPGA written in VHDL
duartegalvao/ArduZynq-Tutorials
Simple tutorials for getting started with programming on Trenz ArduZynq boards.
jtfell/c-fft
C-Implementations of FFT Algorithms.
mahmoudparsian/pyspark-tutorial
PySpark-Tutorial provides basic algorithms using PySpark
ricardo-jasinski/vhdl-csv-file-reader
VHDL package for reading formatted data from comma-separated-values (CSV) files
marmelo/tech-companies-in-portugal
:portugal: List of technology companies in Portugal.
milekium/spidev-lib
simple spidev c/c++ wrapper library
eiWare/LibPiSPI
C++ SPI Library for Raspberry Pi