voice-synthesis

There are 119 repositories under voice-synthesis topic.

coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python31.4k 269 1k3.7k
jim-schwoebel/voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
1.6k 43 17222
DanRuta/xVA-Synth
Machine learning based speech synthesis Electron app, with voices from specific characters from video games
Language:JavaScript585 24 4354
SforAiDl/Neural-Voice-Cloning-With-Few-Samples
This repository has implementation for "Neural Voice Cloning With Few Samples"
Language:Python424 31 22123
hujinsen/pytorch-StarGAN-VC
Fully reproduce the paper of StarGAN-VC. Stable training and Better audio quality .
Language:Python243 7 1257
ZDisket/TensorVox
Desktop application for neural speech synthesis written in C++
Language:C++208 15 920
smoke-trees/Voice-synthesis
This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.
Language:Python160 5 046
zakaton/Pink-Trombone
A programmable version of Neil Thapen's Pink Trombone
Language:JavaScript155 8 528
ManimCommunity/manim-voiceover
Manim plugin for all things voiceover
Language:Python152 4 4420
JollyToday/GhostCut-auto_video_translation
auto video translation-video translator can auto translate video hard subtitles, auto video translation and dubbing, remove any video text, auto remove video subtitles/text. 自动视频翻译配音，自动翻译视频字幕和回填样式，自动硬字幕翻译。
Language:Python113 3 025
Azure-Samples/Cognitive-Services-Voice-Assistant
Welcome to the Microsoft Voice Assistant samples repository! Here you will find samples to help you get started building client application for your bot or Custom Command service. You will also be able to easily deploy a working Custom Command based Voice Assistant to your own Azure subscription
Language:C++97 21 26999
sidmulajkar/sentiment-predictor-for-stress-detection
Voice stress analysis (VSA) aims to differentiate between stressed and non-stressed outputs in response to stimuli (e.g., questions posed), with high stress seen as an indication of deception. In this work, we propose a deep learning-based psychological stress detection model using speech signals. With increasing demands for communication between humans and intelligent systems, automatic stress detection is becoming an interesting research topic. Stress can be reliably detected by measuring the level of specific hormones (e.g., cortisol), but this is not a convenient method for the detection of stress in human- machine interactions. The proposed algorithm first extracts Mel- filter bank coefficients using pre-processed speech data and then predicts the status of stress output using a binary decision criterion (i.e., stressed or unstressed) using CNN (Convolutional Neural Network) and dense fully connected layer networks.
Language:Jupyter Notebook81 5 223
YuzukiTsuru/lessampler
lessampler is a Singing Voice Synthesizer
Language:C++69 9 85
RageAgainstThePixel/com.rest.elevenlabs
A non-official Eleven Labs voice synthesis client for Unity (UPM)
Language:C#68 2 317
nipponjo/tts-arabic-pytorch
TTS models for Arabic (Tacotron2, FastPitch)
Language:Jupyter Notebook66 2 1515
spokestack/spokestack-android
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Language:Java66 6 95
hparcells/rtvc
💬 "Realtime" voice transcription and cloning using ElevenLabs's API.
Language:TypeScript48 3 53
RageAgainstThePixel/ElevenLabs-DotNet
A Non-Official ElevenLabs RESTful API Client for dotnet
Language:C#48 4 1115
chdh/klatt-syn
Klatt formant synthesizer
Language:TypeScript44 1 03
olaviinha/NeuralTextToAudio
Text prompt steered synthetic audio generators
Language:Jupyter Notebook44 3 36
spokestack/spokestack-ios
Spokestack: give your iOS app a voice interface!
Language:Swift42 5 118
wafflecomposite/15.ai-Python-API
Python3 script for interaction with https://fifteen.ai/
Language:Python38 7 612
jim-schwoebel/nala
🦁 Nala is an agile open-source voice assistant framework (20+ actions).
Language:Python34 4 514
lyrebird-ai/lyrebird-slack-integration
Send voicified messages on Slack using your vocal avatar!
Language:JavaScript33 8 011
N3RDIUM/JARVIS
Better JARVIS, with faster and smarter responses, topped off with amazing visuals.
Language:Python27 4 32
YuzukiTsuru/SinsyPlus
Singing Voice Synthesis System based on Sinsy
Language:Python23 2 63
manhph2211/ml-deployment
Pushing Text To Speech models into production using torchserve, kubernetes and react web app :smile:
Language:Python21 3 02
shun60s/Vocal-Tube-Model
a very simple vocal tract model, few tube model. generate vowel sound by it
Language:Python18 2 03
BullShark/JSpeak
A Text to Speech Reader Front-end that Reads from the Clipboard and with Exceptionable Features
Language:Java16 3 63
VisionBrain/Neural_Voice_Cloning
Open Source Implementation of Neural Voice Cloning with Few Audio Samples (Baidu Research)
Language:Python16 3 16
Harium/espeak-java
espeak java wrapper
Language:Java15 4 38
TheShadow29/VC-with-GAN
Voice Conversion with GANs
Language:Python15 4 23
lifecompanionaac/lifecompanion
LifeCompanion is a free open-source AAC software
Language:Java12 1 3433
brycehowitson/SSML-prosody-library
A collection of pre-built speech synthesis settings used to convey emotion
11 4 03
sil-ai/tts-singlish
TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.
Language:Python9 4 03
Mohamedhany99/Voice-Frequency-Extraction-Signal-Processing-
This Script is able to extract Frequency of the voice detected in an audio file (preferred in ".wav" filetype)
Language:Python7 2 01

voice-synthesis

coqui-ai/TTS

jim-schwoebel/voice_datasets

DanRuta/xVA-Synth

SforAiDl/Neural-Voice-Cloning-With-Few-Samples

hujinsen/pytorch-StarGAN-VC

ZDisket/TensorVox

smoke-trees/Voice-synthesis

zakaton/Pink-Trombone

ManimCommunity/manim-voiceover

JollyToday/GhostCut-auto_video_translation

Azure-Samples/Cognitive-Services-Voice-Assistant

sidmulajkar/sentiment-predictor-for-stress-detection

YuzukiTsuru/lessampler

RageAgainstThePixel/com.rest.elevenlabs

nipponjo/tts-arabic-pytorch

spokestack/spokestack-android

hparcells/rtvc

RageAgainstThePixel/ElevenLabs-DotNet

chdh/klatt-syn

olaviinha/NeuralTextToAudio

spokestack/spokestack-ios

wafflecomposite/15.ai-Python-API

jim-schwoebel/nala

lyrebird-ai/lyrebird-slack-integration

N3RDIUM/JARVIS

YuzukiTsuru/SinsyPlus

manhph2211/ml-deployment

shun60s/Vocal-Tube-Model

BullShark/JSpeak

VisionBrain/Neural_Voice_Cloning

Harium/espeak-java

TheShadow29/VC-with-GAN

lifecompanionaac/lifecompanion

brycehowitson/SSML-prosody-library

sil-ai/tts-singlish

Mohamedhany99/Voice-Frequency-Extraction-Signal-Processing-