MaN0bhiR's Stars
tvhahn/PyPHM
Machinery data, made easy. Easily download and prepare common industrial datasets.
Flux9665/TTSCorpusCreator
A tool that makes creating text-to-speech corpora easier.
Vaibhavs10/insanely-fast-whisper
gabrielmittag/NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
X-LANCE/VoiceFlow-TTS
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
p1an-lin-jung/WavThruVec_pytorch
An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"
henrymaas/AudioSlicer
Audio Slicer that uses silence detection to split .wav audio files into multiple .wav samples.
innnky/diff-svc
An Implementation of Singing Voice Conversion Based on Diffsinger
pongasoft/jamba
A lightweight VST2/3 framework
juce-framework/JUCE
JUCE is an open-source cross-platform C++ application framework for desktop and mobile applications, including VST, VST3, AU, AUv3, LV2 and AAX audio plug-ins.
HaiFengZeng/expressive_tacotron
smacke/ffsubsync
Automagically synchronize subtitles with video.
itkho/music-remover
Removes the music from an audio file, so that only the voices remain
groundcat/Google-AI-video-transcribe-subtitle-generator
Transcribes video using GCP speech-to-text and generates .SRT subtitles
rafalkrol-xyz/youtube-subs
A simple project for generating subtitles for YouTube videos using GCP's Speech-to-Text API.
KunZhou9646/Speaker-independent-emotional-voice-conversion-based-on-conditional-VAW-GAN-and-CWT
This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conversion".
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
auspicious3000/AutoPST
Global Rhythm Style Transfer Without Text Transcriptions
Sindhu-Hegde/pseudo-visual-speech-denoising
Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021
adhadse/Deepdubpy
A complete end-to-end Deep Learning system to generate high quality human like speech in English for Korean Drama (WIP)
magnum79/splitAudioBySrt
Slices video files by timeframes from subtitles.
Edresson/YourTTS
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
andabi/deep-voice-conversion
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
xingchensong/ASR-Wavnet
some ASR-system implementations (via tensorflow 1.x)
elastic/elasticsearch-py
Official Python client for Elasticsearch
chaharnishant11/PlacementPrepGuide
Includes all the resources for Core CS fundamentals
GarvTambi/robocomp
RoboComp is a cutting-edge open-source robotics framework providing tools to easily create, modify and manage robot software components.