piplyman

Pinned Repositories

Algorithm_Interview_Notes-Chinese
2018/2019/校招/春招/秋招/算法/机器学习(Machine Learning)/深度学习(Deep Learning)/自然语言处理(NLP)/C/C++/Python/面试笔记
Language:Python0 1 00
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python0 0 00
ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Language:Python0 1 00
awesome-automl-papers
A curated list of automated machine learning papers, articles, tutorials, slides and projects
0 1 00
Awesome-Speech-Language-Model
Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.
00
ba-dls-deepspeech
Language:Python0 1 00
basic-yolo-keras
Easy training on custom dataset. Various backends (MobileNet and SqueezeNet) supported. A YOLO demo to detect raccoon run entirely in brower is accessible at https://git.io/vF7vI (not on Windows).
Language:Jupyter Notebook00
ChatTTS
A generative speech model for daily dialogue.
Language:Python0 0 00
coding-interview-university
A complete computer science study plan to become a software engineer.
0 1 00
DeepSpeech
A PaddlePaddle implementation of DeepSpeech2 architecture for ASR.
Language:Python0 1 00

piplyman's Repositories

piplyman/Algorithm_Interview_Notes-Chinese
2018/2019/校招/春招/秋招/算法/机器学习(Machine Learning)/深度学习(Deep Learning)/自然语言处理(NLP)/C/C++/Python/面试笔记
Language:Python0 1 00
piplyman/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python0 0 00
piplyman/ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Language:Python0 1 00
piplyman/awesome-automl-papers
A curated list of automated machine learning papers, articles, tutorials, slides and projects
0 1 00
piplyman/Awesome-Speech-Language-Model
Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.
00
piplyman/ChatTTS
A generative speech model for daily dialogue.
Language:Python0 0 00
piplyman/coding-interview-university
A complete computer science study plan to become a software engineer.
0 1 00
piplyman/DeepSpeech
A PaddlePaddle implementation of DeepSpeech2 architecture for ASR.
Language:Python0 1 00
piplyman/DeepSpeech-1
A TensorFlow implementation of Baidu's DeepSpeech architecture
Language:C++0 1 00
piplyman/listen-attend-and-spell
Tensorflow implementation of "Listen, Attend and Spell" authored by William Chan. This project utilizes input pipeline and estimator API of Tensorflow, which makes the training and evaluation truly end-to-end.
Language:Python0 1 00
piplyman/fish-speech
Brand new TTS solution
Language:Python0 0
piplyman/lossless-cut
The swiss army knife of lossless video/audio editing
piplyman/MARS5-TTS
MARS5 speech model (TTS) from CAMB.AI
Language:Python0 0
piplyman/masr
中文语音识别，提供预训练模型，高识别率 Chinese Speech Recognition; Mandarin Automatic Speech Recognition;
Language:Python
piplyman/MDDM
Language:HTML1 0
piplyman/models
Pre-trained and Reproduced Deep Learning Models （经典复现模型）
Language:Python1 0
piplyman/nabu
Code for end-to-end ASR with neural networks, build with TensorFlow
Language:Python
piplyman/Online-Speech-Recognition
Working online speech recognition based on RNN Transducer. ( Trained model release soon ... )
piplyman/pumpkin-book
《机器学习》（西瓜书）公式推导解析，在线阅读地址：https://datawhalechina.github.io/pumpkin-book
1 0
piplyman/pymixconsole
Headless multitrack mixing console in Python
Language:Python0 0
piplyman/snac
Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
piplyman/so-vits-svc
SoftVC VITS Singing Voice Conversion
piplyman/soundstorm
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
piplyman/SoundStorm-pytorch
Google's SoundStorm: Efficient Parallel Audio Generation
piplyman/SoundStorm-yangdongchao
The reproduced code for Google's SoundStorm
piplyman/spear-tts-pytorch
Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch
piplyman/transformer
A TensorFlow Implementation of the Transformer: Attention Is All You Need
Language:Python
piplyman/UnivHD
Language:HTML
piplyman/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Language:Python0 0
piplyman/video-subtitle-extractor
视频硬字幕提取，生成srt文件。无需申请第三方API，本地实现文本识别。基于深度学习的视频字幕提取框架，包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
Language:Python0 0