gabitza-tech's Stars
kssteven418/Squeezeformer
[NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
miguelvalente/whisperer
Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.
abetlen/llama-cpp-python
Python bindings for llama.cpp
ggerganov/llama.cpp
LLM inference in C/C++
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
daniel03c1/NAS_VAD
nicklashansen/voice-activity-detection
Voice Activity Detection (VAD) using deep learning.
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
oscarknagg/raw-audio-gender-classification
Machine learning experiment to perform gender classification from raw audio.
zhihanyang2022/gender-audio-classification
A speaker gender classifier. MFC feature engineering and a pre-trained ResNet-50. GradCAM interpretation.
jerpint/voicemd
jim-schwoebel/voice_gender_detection
♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).
grep-rohan/GenderRecognitionByVoice
Using machine learning to recognise gender by analysing recorded voice.
x4nth055/gender-recognition-by-voice
Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2
diovisgood/agender
Real-time estimation of gender and age
primaryobjects/voice-gender
Gender recognition by voice and speech analysis
ina-foss/inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
SuperKogito/Voice-based-gender-recognition
:sound: :boy: :girl:Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)
Nitro-Language-Processing/Workshops-2023
eandersson/amqpstorm
Thread-safe Python RabbitMQ Client & Management library
desh2608/dover-lap
Python package for combining diarization system outputs.
SRA2/SPELL
Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection (ECCV 2022)
ocontreras309/ML_Notebooks
A repository for public Machine Learning notebooks I have created
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Xflick/EEND_PyTorch
A PyTorch implementation of End-to-End Neural Diarization
wq2012/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
espnet/espnet
End-to-End Speech Processing Toolkit
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding