actuy

actuy's Stars

CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:Python53.2k 941 1.1k8.8k
jhao104/proxy_pool
Python ProxyPool for web spider
Language:Python21.8k 445 6175.2k
soumith/ganhacks
starter from "How to Train a GAN?" at NIPS2016
11.5k 345 691.7k
facebookresearch/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Language:Python8.5k 156 5441.1k
nl8590687/ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Language:Python7.9k 183 2921.9k
openai/jukebox
Code for the paper "Jukebox: A Generative Model for Music"
Language:Python7.9k 300 2621.4k
keithito/tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Language:Python3k 147 323958
Rayhane-mamah/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
Language:Python2.3k 134 472906
bytedance/GiantMIDI-Piano
Language:Python1.7k 24 11180
bytedance/piano_transcription
Language:Python1.7k 26 33202
lowerquality/gentle
gentle forced aligner
Language:Python1.5k 45 237297
MontrealCorpusTools/Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
Language:Python1.4k 35 726251
coqui-ai/open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
1.3k 57 199141
YannickJadoul/Parselmouth
Praat in Python, the Pythonic way
Language:C++1.1k 22 77117
guoday/Tencent2020_Rank1st
The code for 2020 Tencent College Algorithm Contest, and the online result ranks 1st.
Language:Python1k 18 18317
Tonejs/Midi
Convert MIDI into Tone.js-friendly JSON
Language:TypeScript901 28 105120
yy1lab/Lyrics-Conditioned-Neural-Melody-Generation
Language:Jupyter Notebook425 14 1261
santi-pdp/segan_pytorch
Speech Enhancement Generative Adversarial Network in PyTorch
Language:Python383 12 35110
HLTSingapore/Emotional-Speech-Data
This is the GitHub page for publicly available emotional speech data.
329 5 224
music-x-lab/POP909-Dataset
This is the dataset repository for the paper: POP909: A Pop-song Dataset for Music Arrangement Generation
Language:Python299 5 941
danmic/av-se
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
205 12 222
hsinyuan-huang/FlowQA
Implementation of conversational QA model: FlowQA (with slight improvement)
Language:Python197 9 2657
JusperLee/Looking-to-Listen-at-the-Cocktail-Party
Executable code based on Google articles
Language:Python165 7 2245
JeremyCCHsu/vqvae-speech
Tensorflow implementation of the speech model described in Neural Discrete Representation Learning (a.k.a. VQ-VAE)
Language:Python128 10 631
iamyuanchung/speech2vec-pretrained-vectors
Speech2vec pre-trained word vectors
77 2 411
DDMAL/jSymbolic2
2nd Version of jSymbolic
Language:Java31 13 393
cifkao/ismir2019-music-style-translation
The code for the ISMIR 2019 paper “Supervised symbolic music style translation using synthetic data”.
Language:Python27 6 36
meelement/noise_adversarial_tacotron
Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorization
Language:Jupyter Notebook17 1 06
isl-mt/fluent-fisher
14 4 01
eastonYi/Unsupervised-ASR
unsupervised ASR (mainly phone classifier) using EODM and GAN
Language:Python12 1 04