speech-embeddings

There are 6 repositories under speech-embeddings topic.

usc-sail/gen-dmcca
Generalized Deep Multiset Canonical Correlation Analysis for Multiview Learning of Speech Representations
Language:Python12 5 06
jvel07/wav2vec2_patho
Fine-tuning wav2vec2 to for Pathological Speech Processing
Language:Jupyter Notebook6 1 11
jvel07/dnn_embeddings_pytorch
DNN embeddings extraction from audio and speech recordings using PyTorch.
Language:Python2 2 01
peter-yh-wu/cross-lingual
Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity
Language:Python2 1 01
epistoteles/predicting-speaker-quality
This repository belongs to my Bachelor's thesis on predicting voice likability from pre-trained speech embeddings.
Language:Python1 2 00
NN-Project-1/dis-Vector-Embedding
The Dis-Vector project enhances voice conversion and synthesis through disentangled embeddings, allowing for high-quality, zero-shot voice cloning across multiple languages. This model leverages separate encoders for content, pitch, rhythm, and timbre, enabling precise control over synthesized voice characteristics.
Language:Python1 1 00