LifaSun

Andy Sun

LifaSun's Stars

pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python85.3k23k
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python137k27.4k
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.7k6.4k
facebookresearch/libri-light
dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.
Language:Python48478
openai/gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
Language:Python22.7k5.5k
openai/DALL-E
PyTorch package for the discrete VAE used for DALL·E.
Language:Python10.8k1.9k
openai/gpt-3
GPT-3: Language Models are Few-Shot Learners
15.7k2.3k
zzw922cn/awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
3k514
google-ai-edge/mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
Language:C++28.1k5.2k
alievk/avatarify-python
Avatars for Zoom, Skype and other video-conferencing apps.
Language:Python16.3k4.1k
ming024/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Language:Python1.9k544
JasonWei512/Tacotron-2-Chinese
（已过时）中文语音合成，改自 https://github.com/Rayhane-mamah/Tacotron-2 和 https://github.com/begeekmyfriend/Tacotron-2
Language:Python29970
lancopku/pkuseg-python
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
Language:Python6.6k988
fxsjy/jieba
结巴中文分词
Language:Python33.5k6.7k
xiangyuecn/Recorder
html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式，支持pc和Android、iOS部分浏览器、Hybrid App（提供Android iOS App源码）、微信，提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码
Language:JavaScript5k1k
NVIDIA/waveglow
A Flow-based Generative Network for Speech Synthesis
Language:Python2.3k531
descriptinc/melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
Language:Python989215
TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Language:Python3.9k816
kan-bayashi/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Language:Jupyter Notebook1.6k343
espnet/espnet
End-to-End Speech Processing Toolkit
Language:Python8.6k2.2k
Hiroshiba/realtime-yukarin
An application for real-time voice conversion
Language:Python33051
Alexander-H-Liu/End-to-end-ASR-Pytorch
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.
Language:Python1.2k317
QianyanTech/Image-Downloader
Download images from Google, Bing, Baidu. 谷歌、百度、必应图片下载.
Language:Python2.2k576
TimoBolkart/voca
This codebase demonstrates how to synthesize realistic 3D character animations given an arbitrary speech signal and a static character mesh.
Language:Python1.2k273
dunbar12138/Audiovisual-Synthesis
Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders
Language:Python12024
Picovoice/cheetah
On-device streaming speech-to-text engine powered by deep learning
Language:Python60068
tristandeleu/pytorch-meta
A collection of extensions and data-loaders for few-shot learning & meta-learning in PyTorch
Language:Python2k256
sudharsan13296/Hands-On-Meta-Learning-With-Python
Learning to Learn using One-Shot Learning, MAML, Reptile, Meta-SGD and more with Tensorflow
Language:Jupyter Notebook1.2k360
learnables/learn2learn
A PyTorch Library for Meta-learning Research
Language:Python2.7k355
wblgers/py_speech_seg
A toolkit to implement segmentation on speech based on BIC and nerual network, such as BiLSTM
Language:Python12238

LifaSun

LifaSun's Stars

pytorch/pytorch

huggingface/transformers

facebookresearch/fairseq

facebookresearch/libri-light

openai/gpt-2

openai/DALL-E

openai/gpt-3

zzw922cn/awesome-speech-recognition-speech-synthesis-papers

google-ai-edge/mediapipe

alievk/avatarify-python

ming024/FastSpeech2

JasonWei512/Tacotron-2-Chinese

lancopku/pkuseg-python

fxsjy/jieba

xiangyuecn/Recorder

NVIDIA/waveglow

descriptinc/melgan-neurips

TensorSpeech/TensorFlowTTS

kan-bayashi/ParallelWaveGAN

espnet/espnet

Hiroshiba/realtime-yukarin

Alexander-H-Liu/End-to-end-ASR-Pytorch

QianyanTech/Image-Downloader

TimoBolkart/voca

dunbar12138/Audiovisual-Synthesis

Picovoice/cheetah

tristandeleu/pytorch-meta

sudharsan13296/Hands-On-Meta-Learning-With-Python

learnables/learn2learn

wblgers/py_speech_seg