wav2vec
There are 24 repositories under wav2vec topic.
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
mailong25/self-supervised-speech-recognition
speech to text with self-supervised learning based on wav2vec 2.0 framework
oliverguhr/wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
arxyzan/data2vec-pytorch
PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI
shangeth/SpeakerProfiling
Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf
loretoparisi/wave2vec-recognize-docker
Wave2vec 2.0 Recognize pipeline
robinhad/voice-recognition-ua
Training scripts for Speech-To-Text models for Ukrainian language
lucasgris/wav2vec4bp
Wav2vec resources and models for Brazilian Portuguese
bhattbhavesh91/wav2vec2-huggingface-demo
Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer
daanzu/wav2vec2_stt_python
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition
notAI-tech/IndicASR
Speeech Recognition for Indic languages.
jvel07/wav2vec2_patho
Fine-tuning wav2vec2 to for Pathological Speech Processing
phanxuanphucnd/wav2asr
A library version of wav2vec 2.0 framework for Automatic Speech Recognition task.
manhph2211/Speech-Processing
Building a speaker identification & verification pipeline for Vietnamese voices :sleepy:
NabinAdhikari674/wav2vec
A repo to make installation and training of a wav2vec model easier
oswaldoludwig/Pruning-pre-trained-models-using-evolutionary-computation
This repository contains scripts to prune Wav2vec2 using a neuroevolution-based method. More details about this method can be found in the paper Compressing Wav2vec2 for Embedded Applications.
abdur75648/DINet-Inference
Create high-resolution visually dubbed videos with DINet
MarwaAbdelAal/ASR-correction-model
ASR model generates transcription from audio waves, then correct the word spelling
mead-ml/audio8
Deep audio modeling
dpigasin/speech_recognition
Recognition of a medical diagnosis from speech.
hciays/ailab_ss2022
asr for German Language
Katashynskyi/Voice_assistant_UA_EN
No api-keys | local | llama3.1 For language studying and live translation