wav2vec
There are 25 repositories under wav2vec topic.
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
mailong25/self-supervised-speech-recognition
speech to text with self-supervised learning based on wav2vec 2.0 framework
oliverguhr/wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
arxyzan/data2vec-pytorch
PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI
shangeth/SpeakerProfiling
Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf
robinhad/voice-recognition-ua
Training scripts for Speech-To-Text models for Ukrainian language
lucasgris/wav2vec4bp
Wav2vec resources and models for Brazilian Portuguese
loretoparisi/wave2vec-recognize-docker
Wave2vec 2.0 Recognize pipeline
bhattbhavesh91/wav2vec2-huggingface-demo
Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer
daanzu/wav2vec2_stt_python
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition
notAI-tech/IndicASR
Speeech Recognition for Indic languages.
jvel07/wav2vec2_patho
Fine-tuning wav2vec2 to for Pathological Speech Processing
phanxuanphucnd/wav2asr
A library version of wav2vec 2.0 framework for Automatic Speech Recognition task.
abdur75648/DINet-Inference
Create high-resolution visually dubbed videos with DINet
Katashynskyi/Voice_assistant_UA_EN
No api-keys | local | llama3.1 For language studying and live translation
manhph2211/DSP101
Building a speaker identification & verification pipeline for Vietnamese voices :sleepy:
NabinAdhikari674/wav2vec
A repo to make installation and training of a wav2vec model easier
oswaldoludwig/Pruning-pre-trained-models-using-evolutionary-computation
This repository contains scripts to prune Wav2vec2 using a neuroevolution-based method. More details about this method can be found in the paper Compressing Wav2vec2 for Embedded Applications.
MarwaAbdelAal/ASR-correction-model
ASR model generates transcription from audio waves, then correct the word spelling
mead-ml/audio8
Deep audio modeling
mradovic38/voice-command-recognition
Smart home controller simulator, receiving voice commands from a microphone.
hciays/ailab_ss2022
asr for German Language
jkyl/vq-vae
A JAX / NNX implementation of a VQ-VAE for audio compression