Pinned Repositories
CJK-character-scrape
Using httplib2 and CC-CEDICT Chinese-English dictionary, retrieve all chinese character text in a file alongside its definitions and places the result into a CSV file.
Emotion-Classification-Ravdess
Understanding emotions with Neural Networks (Python, Scikit-Learn, Keras) and the Ravdess dataset.
espnet
End-to-End Speech Processing Toolkit
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
taiwanese-tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
taiwanese_tonal_tlpa_tacotron2
voice-vector
A deep neural network for finding text-independent speaker embedding written in tensorflow
wavetomidi
to make a wave file to a standard midi file , using stft.
Whisper-Finetune
微调Whisper语音识别模型,支持无时间戳数据训练,有时间戳数据训练、无语音数据训练。加速推理,支持Web部署、Windows桌面部署和Android部署
whisper-hakka
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
yfliao's Repositories
yfliao/Emotion-Classification-Ravdess
Understanding emotions with Neural Networks (Python, Scikit-Learn, Keras) and the Ravdess dataset.
yfliao/voice-vector
A deep neural network for finding text-independent speaker embedding written in tensorflow
yfliao/Auditory_model_based_MMSE_estimator
A denoising algorithm/ Speech Enhancement method
yfliao/ChatterBot
ChatterBot is a machine learning, conversational dialog engine for creating chat bots
yfliao/Code-for-MPELU
Code for Improving Deep Neural Network with Multiple Parametric Exponential Linear Units
yfliao/compare_gan
Compare GAN code.
yfliao/DuReader
Baseline Systems of DuReader Dataset
yfliao/graphd
The Metaweb graph repository server
yfliao/HyperLPR
基于深度学习高性能中文车牌识别 High Performance Chinese License Plate Recognition Framework.
yfliao/librosa
Python library for audio and music analysis
yfliao/nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
yfliao/nnmnkwii
Library to build speech synthesis systems designed for easy and fast prototyping.
yfliao/overview_optalgs
yfliao/Practical_DL
DL course co-developed by HSE, YSDA and Skoltech
yfliao/pykaldi
A Python wrapper for Kaldi
yfliao/PythonDataScienceHandbook
Python Data Science Handbook: full text in Jupyter Notebooks
yfliao/PytorchWaveNetVocoder
WaveNet-Vocoder implementation with pytorch
yfliao/pywordseg
Open Source State-of-the-art Chinese Word Segmentation System with BiLSTM and ELMo. https://arxiv.org/abs/1901.05816
yfliao/SparkNG
MATLAB realtime/interactive speech tools
yfliao/Speech-Emotion-Analyzer
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
yfliao/speech-keras
Code for the blog post "Simple Audio Classification with Keras"
yfliao/SpeechDenoisingDNN
Removing various types of noises present in the speech using Deep Neural Networks
yfliao/sprocket
Voice Conversion Tool Kit
yfliao/style2paints
sketch + style = paints :art:
yfliao/tf2_course
Notebooks for my "Deep Learning with TensorFlow 2 and Keras" course
yfliao/theMLbook
The Python code to reproduce the illustrations from The Hundred-Page Machine Learning Book.
yfliao/TTS-Tools
Some useless thing for Speech Synthesis System
yfliao/VAD
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
yfliao/Wave-U-Net-For-Speech-Enhancement
Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implemented for the task of speech enhancement in the time-domain.
yfliao/YANGstraight_source
Wavelet-based source information analysis for YANGstraight and real-time interactive tools