wav2vec

There are 25 repositories under wav2vec topic.

s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Language:Python2.5k 44 407513
mailong25/self-supervised-speech-recognition
speech to text with self-supervised learning based on wav2vec 2.0 framework
Language:Python383 13 66117
oliverguhr/wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Language:Python367 7 1658
arxyzan/data2vec-pytorch
PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI
Language:Python182 3 2025
shangeth/SpeakerProfiling
Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf
Language:Python66 3 922
robinhad/voice-recognition-ua
Training scripts for Speech-To-Text models for Ukrainian language
Language:Jupyter Notebook38 4 72
lucasgris/wav2vec4bp
Wav2vec resources and models for Brazilian Portuguese
Language:Jupyter Notebook34 2 32
loretoparisi/wave2vec-recognize-docker
Wave2vec 2.0 Recognize pipeline
Language:Python33 8 610
bhattbhavesh91/wav2vec2-huggingface-demo
Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer
Language:Jupyter Notebook30 2 214
daanzu/wav2vec2_stt_python
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition
Language:Python24 5 13
notAI-tech/IndicASR
Speeech Recognition for Indic languages.
Language:Python13 4 13
jvel07/wav2vec2_patho
Fine-tuning wav2vec2 to for Pathological Speech Processing
Language:Jupyter Notebook6 1 11
phanxuanphucnd/wav2asr
A library version of wav2vec 2.0 framework for Automatic Speech Recognition task.
Language:Python4 1 04
thisisHJLee/Fine-Tuning-of-XLSR-Wav2Vec2-on-Korean
Language:Jupyter Notebook4 1 01
abdur75648/DINet-Inference
Create high-resolution visually dubbed videos with DINet
Language:Python3 1 0
Katashynskyi/Voice_assistant_UA_EN
No api-keys | local | llama3.1 For language studying and live translation
Language:Python3 1 11
manhph2211/DSP101
Building a speaker identification & verification pipeline for Vietnamese voices :sleepy:
Language:Jupyter Notebook3 1 1
NabinAdhikari674/wav2vec
A repo to make installation and training of a wav2vec model easier
Language:Python2 1 00
oswaldoludwig/Pruning-pre-trained-models-using-evolutionary-computation
This repository contains scripts to prune Wav2vec2 using a neuroevolution-based method. More details about this method can be found in the paper Compressing Wav2vec2 for Embedded Applications.
Language:Shell2 1 01
MarwaAbdelAal/ASR-correction-model
ASR model generates transcription from audio waves, then correct the word spelling
Language:Python1 1 00
mead-ml/audio8
Deep audio modeling
Language:Python1 2 10
mradovic38/voice-command-recognition
Smart home controller simulator, receiving voice commands from a microphone.
Language:Jupyter Notebook1 1 01
hciays/ailab_ss2022
asr for German Language
Language:Python0 1 00
jkyl/vq-vae
A JAX / NNX implementation of a VQ-VAE for audio compression
Language:Jupyter Notebook0 1 00
Natalia-T/NeurIPS2021
Language:Python0 0 00

wav2vec

s3prl/s3prl

mailong25/self-supervised-speech-recognition

oliverguhr/wav2vec2-live

arxyzan/data2vec-pytorch

shangeth/SpeakerProfiling

robinhad/voice-recognition-ua

lucasgris/wav2vec4bp

loretoparisi/wave2vec-recognize-docker

bhattbhavesh91/wav2vec2-huggingface-demo

daanzu/wav2vec2_stt_python

notAI-tech/IndicASR

jvel07/wav2vec2_patho

phanxuanphucnd/wav2asr

thisisHJLee/Fine-Tuning-of-XLSR-Wav2Vec2-on-Korean

abdur75648/DINet-Inference

Katashynskyi/Voice_assistant_UA_EN

manhph2211/DSP101

NabinAdhikari674/wav2vec

oswaldoludwig/Pruning-pre-trained-models-using-evolutionary-computation

MarwaAbdelAal/ASR-correction-model

mead-ml/audio8

mradovic38/voice-command-recognition

hciays/ailab_ss2022

jkyl/vq-vae

Natalia-T/NeurIPS2021