Deepfake-Audio

👉🏻 An audio deepfake is when a “cloned” voice that is potentially indistinguishable from the real person’s is used to produce synthetic audio.

Google Colaboratory
Kaggle
Model
Project Demo

This project is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time.

SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generate new voices.

👉🏻 Created to Learn the working of Deepfake-Audio 👈🏻

✌🏻 Back To Repository ✌🏻

DotWasi/DEEPFAKE-AUDIO

Deepfake-Audio