speech-embeddings

There are 6 repositories under speech-embeddings topic.

  • usc-sail/gen-dmcca

    Generalized Deep Multiset Canonical Correlation Analysis for Multiview Learning of Speech Representations

    Language:Python12506
  • jvel07/wav2vec2_patho

    Fine-tuning wav2vec2 to for Pathological Speech Processing

    Language:Jupyter Notebook6111
  • jvel07/dnn_embeddings_pytorch

    DNN embeddings extraction from audio and speech recordings using PyTorch.

    Language:Python2201
  • peter-yh-wu/cross-lingual

    Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity

    Language:Python2101
  • epistoteles/predicting-speaker-quality

    This repository belongs to my Bachelor's thesis on predicting voice likability from pre-trained speech embeddings.

    Language:Python1200
  • NN-Project-1/dis-Vector-Embedding

    The Dis-Vector project enhances voice conversion and synthesis through disentangled embeddings, allowing for high-quality, zero-shot voice cloning across multiple languages. This model leverages separate encoders for content, pitch, rhythm, and timbre, enabling precise control over synthesized voice characteristics.

    Language:Python1100