hubert
There are 34 repositories under hubert topic.
voicepaw/so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
lstrgar/self-supervised-phone-segmentation
Phoneme segmentation using pre-trained speech models
vectominist/MiniASR
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
ECNU-Cross-Innovation-Lab/ShiftSER
[ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
LetianLee/Speech-Emotion-Recognition
An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning on the RAVDESS dataset.
mjhydri/Singing-Vocal-Beat-Tracking
This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBERT pre-trained speech models to create vocal embeddings and trains linear multi-head self-attention layers on top of them to extract vocal beat activations. Then, it uses HMM decoder to infer signing beats and tempo.
reshalfahsi/AI-Cover-Song
Cover Song Powered by SoftVC VITS
skit-ai/Map-Mix
The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at ICASSP-2023)
FinnDore/hubert
Hubert
MahdeenSky/SoftVC-VITS-MusicSingerChanger
Google collab for testing SoftVC VITS Singing Voice Conversion for AI capable of changing the singer within music files.
backspacetg/distilAlhubert
code for our paper DistilALHuBERT: A Distilled Parameter Sharing Audio Representation Model
yaya-sy/speechscorer
unsupervised spoken utterances scoring
ahmadara/Sentiment-analysis-from-human-voice-using-the-Hubert-model.
In this code, we have used common and well-known datasets such as the Toronto dataset available on Kaggle to create a sentiment analysis model from human voice. This model is designed based on the Bert model and is called Hubert.
sadPororo/UniPool-SV
Universal Pooling Method for Speaker Verification Utilizing Pre-trained Multi-layer Features, 2025 preprint
The-Hubert/The-Hubert
The home of The Hubert™️
HubertRyanOfficial/react-native-persist-context
A library to help your context being persisted in your react native apps
Rumeysakeskin/Speech-Emotion-Recognition-Turkish-and-more
Advanced Speech Emotion Recognition, based on ExHuBERT: Enhancing HuBERT Through Block Extension and Fine-Tuning on 37 Emotion Datasets and 14 languages (Emotions: Disgust, Neutral, Kind, Anger, Surprise, Joy)
TerboucheHacene/speech-keyword-spotting
Speech Keyword detection using Wav2Vec Model
Moustachauve/PorneliusHubert.com
A website dedicated to the (fictional) original founder of Pornhub, Pornelius Hubert
aitor-alvarez/acoustic-transformer-models
Acoustic Transformer Models for Audio Classification
f-lab-edu/cat-caring-community
길고양이 커뮤니티 서비스
f-lab-edu/sago
실시간 온라인 경매 거래 플랫폼
GiovaneIwamoto/voice-cloning-bark-hubert
🐶 Voice Cloning Bark HuBERT - Enables voice cloning from personalized audio samples by processing model's outputs into semantic tokens compatible with text-to-audio system.
Youssef-Shehata/EduToons
EduToons is an innovative e-learning platform that transforms traditional teaching materials into engaging animated content using advanced AI technologies.
akash13s/audio-to-image
Pipeline for generating images conditioned on input audio
anilkeshwani/speech-text-alignment
Functionality for speech data processing including time alignment, encoding with speech encoders (tokenizers) and data preprocessing of common datasets
educarrascov/Diplo_BigData
Repositorio creado para almacenar archivos, script y el informe final del curso de modelamiento estadístico del Diplomado en Big Data de la Pontificia Universidad Católica de Chile.
hallowshaw/Vox-Debate
VoxDebate is an AI-powered debate platform built on the MERN stack, featuring voice input, real-time sentiment analysis with Hugging Face, and intelligent responses via Google Gemini Pro. With a sleek design using Tailwind CSS and shadcn-ui, it offers a dynamic, responsive, and engaging experience for meaningful discussions.
Mredulraj/Team-7
Speech_Processing_Project
omkar-nitsure/Accent-Adaptation-Codebooks
This repository contains different approaches I tried for improving ASR systems for accented English speech. All of them use the HuBERT model as baseline
ShinHyun-soo/fake-voice-detection
Fake Voice Detection using Hubert (top 20%)