hubert

There are 34 repositories under hubert topic.

voicepaw/so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
Language:Python8.9k 67 3691.2k
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Language:Python2.4k 44 399493
lstrgar/self-supervised-phone-segmentation
Phoneme segmentation using pre-trained speech models
Language:Python55 5 510
vectominist/MiniASR
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
Language:Jupyter Notebook50 4 16
ECNU-Cross-Innovation-Lab/ShiftSER
[ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
Language:Python36 2 22
LetianLee/Speech-Emotion-Recognition
An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning on the RAVDESS dataset.
Language:Jupyter Notebook32 2 17
mjhydri/Singing-Vocal-Beat-Tracking
This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBERT pre-trained speech models to create vocal embeddings and trains linear multi-head self-attention layers on top of them to extract vocal beat activations. Then, it uses HMM decoder to infer signing beats and tempo.
Language:Python28 2 14
reshalfahsi/AI-Cover-Song
Cover Song Powered by SoftVC VITS
Language:Jupyter Notebook17 2 08
skit-ai/Map-Mix
The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at ICASSP-2023)
16 3 31
FinnDore/hubert
Hubert
Language:TypeScript13 1 11
MahdeenSky/SoftVC-VITS-MusicSingerChanger
Google collab for testing SoftVC VITS Singing Voice Conversion for AI capable of changing the singer within music files.
Language:Jupyter Notebook13 2 11
backspacetg/distilAlhubert
code for our paper DistilALHuBERT: A Distilled Parameter Sharing Audio Representation Model
Language:Python6 1 00
yaya-sy/speechscorer
unsupervised spoken utterances scoring
Language:Python6 2 00
ahmadara/Sentiment-analysis-from-human-voice-using-the-Hubert-model.
In this code, we have used common and well-known datasets such as the Toronto dataset available on Kaggle to create a sentiment analysis model from human voice. This model is designed based on the Bert model and is called Hubert.
Language:Jupyter Notebook5 1 00
sadPororo/UniPool-SV
Universal Pooling Method for Speaker Verification Utilizing Pre-trained Multi-layer Features, 2025 preprint
Language:Python5 1 00
The-Hubert/The-Hubert
The home of The Hubert™️
Language:JavaScript5 3 31
HubertRyanOfficial/react-native-persist-context
A library to help your context being persisted in your react native apps
Language:Java4 1 00
Rumeysakeskin/Speech-Emotion-Recognition-Turkish-and-more
Advanced Speech Emotion Recognition, based on ExHuBERT: Enhancing HuBERT Through Block Extension and Fine-Tuning on 37 Emotion Datasets and 14 languages (Emotions: Disgust, Neutral, Kind, Anger, Surprise, Joy)
Language:Jupyter Notebook4 1 0
TerboucheHacene/speech-keyword-spotting
Speech Keyword detection using Wav2Vec Model
Language:Python4 2 00
Moustachauve/PorneliusHubert.com
A website dedicated to the (fictional) original founder of Pornhub, Pornelius Hubert
Language:HTML2 2 01
aitor-alvarez/acoustic-transformer-models
Acoustic Transformer Models for Audio Classification
Language:Python1 1 00
f-lab-edu/cat-caring-community
길고양이 커뮤니티 서비스
Language:Java1 3 3
f-lab-edu/sago
실시간 온라인 경매 거래 플랫폼
Language:Java1 4 101
GiovaneIwamoto/voice-cloning-bark-hubert
🐶 Voice Cloning Bark HuBERT - Enables voice cloning from personalized audio samples by processing model's outputs into semantic tokens compatible with text-to-audio system.
Language:Python1 1 0
Youssef-Shehata/EduToons
EduToons is an innovative e-learning platform that transforms traditional teaching materials into engaging animated content using advanced AI technologies.
Language:JavaScript1 1 03
akash13s/audio-to-image
Pipeline for generating images conditioned on input audio
Language:Python0 0 00
anilkeshwani/speech-text-alignment
Functionality for speech data processing including time alignment, encoding with speech encoders (tokenizers) and data preprocessing of common datasets
Language:Python00
educarrascov/Diplo_BigData
Repositorio creado para almacenar archivos, script y el informe final del curso de modelamiento estadístico del Diplomado en Big Data de la Pontificia Universidad Católica de Chile.
Language:R0 1 00
f-lab-edu/reservation-delegator
Language:Java0 3 61
hallowshaw/Vox-Debate
VoxDebate is an AI-powered debate platform built on the MERN stack, featuring voice input, real-time sentiment analysis with Hugging Face, and intelligent responses via Google Gemini Pro. With a sleek design using Tailwind CSS and shadcn-ui, it offers a dynamic, responsive, and engaging experience for meaningful discussions.
Language:JavaScript0 1 00
Mredulraj/Team-7
Speech_Processing_Project
Language:Jupyter Notebook0 1 00
omkar-nitsure/Accent-Adaptation-Codebooks
This repository contains different approaches I tried for improving ASR systems for accented English speech. All of them use the HuBERT model as baseline
Language:Python00
ShinHyun-soo/fake-voice-detection
Fake Voice Detection using Hubert (top 20%)
Language:Jupyter Notebook0 1 00
f-lab-edu/geulteo
2 0

hubert

voicepaw/so-vits-svc-fork

s3prl/s3prl

lstrgar/self-supervised-phone-segmentation

vectominist/MiniASR

ECNU-Cross-Innovation-Lab/ShiftSER

LetianLee/Speech-Emotion-Recognition

mjhydri/Singing-Vocal-Beat-Tracking

reshalfahsi/AI-Cover-Song

skit-ai/Map-Mix

FinnDore/hubert

MahdeenSky/SoftVC-VITS-MusicSingerChanger

backspacetg/distilAlhubert

yaya-sy/speechscorer

ahmadara/Sentiment-analysis-from-human-voice-using-the-Hubert-model.

sadPororo/UniPool-SV

The-Hubert/The-Hubert

HubertRyanOfficial/react-native-persist-context

Rumeysakeskin/Speech-Emotion-Recognition-Turkish-and-more

TerboucheHacene/speech-keyword-spotting

Moustachauve/PorneliusHubert.com

aitor-alvarez/acoustic-transformer-models

f-lab-edu/cat-caring-community

f-lab-edu/sago

GiovaneIwamoto/voice-cloning-bark-hubert

Youssef-Shehata/EduToons

akash13s/audio-to-image

anilkeshwani/speech-text-alignment

educarrascov/Diplo_BigData

f-lab-edu/reservation-delegator

hallowshaw/Vox-Debate

Mredulraj/Team-7

omkar-nitsure/Accent-Adaptation-Codebooks

ShinHyun-soo/fake-voice-detection

f-lab-edu/geulteo