speech-embeddings
There are 6 repositories under speech-embeddings topic.
usc-sail/gen-dmcca
Generalized Deep Multiset Canonical Correlation Analysis for Multiview Learning of Speech Representations
jvel07/wav2vec2_patho
Fine-tuning wav2vec2 to for Pathological Speech Processing
jvel07/dnn_embeddings_pytorch
DNN embeddings extraction from audio and speech recordings using PyTorch.
peter-yh-wu/cross-lingual
Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity
epistoteles/predicting-speaker-quality
This repository belongs to my Bachelor's thesis on predicting voice likability from pre-trained speech embeddings.
NN-Project-1/dis-Vector-Embedding
The Dis-Vector project enhances voice conversion and synthesis through disentangled embeddings, allowing for high-quality, zero-shot voice cloning across multiple languages. This model leverages separate encoders for content, pitch, rhythm, and timbre, enabling precise control over synthesized voice characteristics.