Pinned Repositories
artyom.js
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
asr-e2e
Automatic speech recognition system using end-to-end approach for Russian speech
ASRT_SpeechRecognition
基于深度学习的语音识别系统
chinese-poetry
最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
chinese_text_normalization
Chinese text normalization for speech processing
interview
notes about questions raised by interviwers and some notes about java,linux shell and javascripts
LPCTron
Tacotron2 + LPCNET for complete End-to-End TTS System
marytts
MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
speech2singing
Implementation of speech to singing of interspeech20' paper.
superhg2012's Repositories
superhg2012/chinese-poetry
最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
superhg2012/chinese_text_normalization
Chinese text normalization for speech processing
superhg2012/speech2singing
Implementation of speech to singing of interspeech20' paper.
superhg2012/chatbot-list
行业内关于智能客服、聊天机器人的应用和架构、算法分享和介绍
superhg2012/clause
:horse_racing: Chatopera语义理解系统
superhg2012/Deep-Learning-in-Production
In this repository, I will share some useful notes and references about deploying deep learning-based models in production.
superhg2012/DeepRL
【深度强化学习社区】一个资料与学习内容最全的服务平台
superhg2012/DurIAN
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
superhg2012/g2pC
g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese
superhg2012/g2pM
A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
superhg2012/GAN-TTS
A pytroch implementation of the GAN-TTS: HIGH FIDELITY SPEECH SYNTHESIS WITH ADVERSARIAL NETWORKS
superhg2012/GPT2-chitchat
GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型
superhg2012/kaldiio
A pure python module for reading and writing kaldi ark files
superhg2012/melgan
MelGAN vocoder (compatible with NVIDIA/tacotron2)
superhg2012/melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
superhg2012/OpenASR
A pytorch based end2end speech recognition system.
superhg2012/pycorrector
pycorrector is a toolkit for text error correction. It was developed to facilitate the designing, comparing, and sharing of deep text error correction models.
superhg2012/Python
All Algorithms implemented in Python
superhg2012/pytorch-handbook
pytorch handbook是一本开源的书籍,目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门,其中包含的Pytorch教程全部通过测试保证可以成功运行
superhg2012/pytorch-onnx-tensorflow-pb
Converting A PyTorch Model to Tensorflow pb using ONNX
superhg2012/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
superhg2012/rnn-transducer
A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition
superhg2012/speedyspeech
superhg2012/tacotron2-melgan
Mel spectrum based on tacotron2 for melgan speech synthesis
superhg2012/tensorflow-wavenet
A TensorFlow implementation of DeepMind's WaveNet paper
superhg2012/TensorflowTTS
:stuck_out_tongue_closed_eyes: TensorflowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2
superhg2012/TensorRT-Inference-Server-Tutorial
服务侧深度学习部署案例
superhg2012/TTS-papers
Collection of Text to Speech papers.
superhg2012/uWebSockets
Simple, secure & standards compliant web I/O for the most demanding of applications
superhg2012/wavegan
WaveGAN: Learn to synthesize raw audio with generative adversarial networks