LifaSun's Stars
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
facebookresearch/libri-light
dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.
openai/gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
openai/DALL-E
PyTorch package for the discrete VAE used for DALL·E.
openai/gpt-3
GPT-3: Language Models are Few-Shot Learners
zzw922cn/awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
google-ai-edge/mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
alievk/avatarify-python
Avatars for Zoom, Skype and other video-conferencing apps.
ming024/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
JasonWei512/Tacotron-2-Chinese
(已过时)中文语音合成,改自 https://github.com/Rayhane-mamah/Tacotron-2 和 https://github.com/begeekmyfriend/Tacotron-2
lancopku/pkuseg-python
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
fxsjy/jieba
结巴中文分词
xiangyuecn/Recorder
html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式,支持pc和Android、iOS部分浏览器、Hybrid App(提供Android iOS App源码)、微信,提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码
NVIDIA/waveglow
A Flow-based Generative Network for Speech Synthesis
descriptinc/melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
kan-bayashi/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
espnet/espnet
End-to-End Speech Processing Toolkit
Hiroshiba/realtime-yukarin
An application for real-time voice conversion
Alexander-H-Liu/End-to-end-ASR-Pytorch
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.
QianyanTech/Image-Downloader
Download images from Google, Bing, Baidu. 谷歌、百度、必应图片下载.
TimoBolkart/voca
This codebase demonstrates how to synthesize realistic 3D character animations given an arbitrary speech signal and a static character mesh.
dunbar12138/Audiovisual-Synthesis
Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders
Picovoice/cheetah
On-device streaming speech-to-text engine powered by deep learning
tristandeleu/pytorch-meta
A collection of extensions and data-loaders for few-shot learning & meta-learning in PyTorch
sudharsan13296/Hands-On-Meta-Learning-With-Python
Learning to Learn using One-Shot Learning, MAML, Reptile, Meta-SGD and more with Tensorflow
learnables/learn2learn
A PyTorch Library for Meta-learning Research
wblgers/py_speech_seg
A toolkit to implement segmentation on speech based on BIC and nerual network, such as BiLSTM