Pinned Repositories
CJK-character-scrape
Using httplib2 and CC-CEDICT Chinese-English dictionary, retrieve all chinese character text in a file alongside its definitions and places the result into a CSV file.
Emotion-Classification-Ravdess
Understanding emotions with Neural Networks (Python, Scikit-Learn, Keras) and the Ravdess dataset.
espnet
End-to-End Speech Processing Toolkit
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
taiwanese-tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
taiwanese_tonal_tlpa_tacotron2
voice-vector
A deep neural network for finding text-independent speaker embedding written in tensorflow
wavetomidi
to make a wave file to a standard midi file , using stft.
Whisper-Finetune
微调Whisper语音识别模型,支持无时间戳数据训练,有时间戳数据训练、无语音数据训练。加速推理,支持Web部署、Windows桌面部署和Android部署
whisper-hakka
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
yfliao's Repositories
yfliao/taiwanese-tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
yfliao/taiwanese_tonal_tlpa_tacotron2
yfliao/arxiv-public-datasets
A set of scripts to grab public datasets from resources related to arXiv
yfliao/awesome-decision-tree-papers
A collection of research papers on decision, classification and regression trees with implementations.
yfliao/Chinese-XLNet
Pre-Trained Chinese XLNet(中文XLNet预训练模型)
yfliao/css10
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
yfliao/DisVoice
python freamework to extract features from speech
yfliao/Dive-into-DL-PyTorch
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。
yfliao/DNS-Challenge
This repo contains the scripts, models and required files for the Interspeech 2020 Deep Noise Suppression (DNS) Challenge. We are open sourcing clean speech and noise files as well. Participants of this challenge will use the scripts from this repo to create data to train their noise suppressors. They will compare their method with our baseline noise suppressor and report the results.
yfliao/emotion-recognition-using-speech
Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras
yfliao/espresso
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
yfliao/FakeLabVoice
Repository For RJI 2020 Student innovation Competition For detcting DeeFake
yfliao/fast-ctc-decode
Blitzing Fast CTC Beam Search Decoder
yfliao/interactive_e2e_speech_recognition
yfliao/Kaldi_ASR
yfliao/matchering-cli
🎚️ Simple Matchering 2.0 Command Line Application
yfliao/mellotron
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
yfliao/mycroft-precise
A lightweight, simple-to-use, RNN wake word listener
yfliao/myReal-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
yfliao/pase
Problem Agnostic Speech Encoder
yfliao/pythoncode-tutorials
The Python Code Tutorials
yfliao/ryAsr2020
Restarting Asr Project @ 2020
yfliao/Speech-enhancement-1
Deep learning for audio denoising
yfliao/symbolic-music-datasets
:musical_keyboard: symbolic musical datasets
yfliao/tensorflow-ctc-speech-recognition
Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0).
yfliao/tensorflow_lite_guide
Guide for quantization, conversation of the tensorflow model to tensorflow lite
yfliao/transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
yfliao/visdom
A flexible tool for creating, organizing, and sharing visualizations of live, rich data. Supports Torch and Numpy.
yfliao/Voice-Conversion-Detection
Automated detection of artificial voices generated by Sprocket
yfliao/Wave-U-Net-for-Speech-Enhancement-1
Implement [Wave-U-Net](https://arxiv.org/abs/1806.03185) by PyTorch, and migrate it to the speech enhancement area.