Pinned Repositories
CJK-character-scrape
Using httplib2 and CC-CEDICT Chinese-English dictionary, retrieve all chinese character text in a file alongside its definitions and places the result into a CSV file.
Emotion-Classification-Ravdess
Understanding emotions with Neural Networks (Python, Scikit-Learn, Keras) and the Ravdess dataset.
espnet
End-to-End Speech Processing Toolkit
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
taiwanese-tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
taiwanese_tonal_tlpa_tacotron2
voice-vector
A deep neural network for finding text-independent speaker embedding written in tensorflow
wavetomidi
to make a wave file to a standard midi file , using stft.
Whisper-Finetune
微调Whisper语音识别模型,支持无时间戳数据训练,有时间戳数据训练、无语音数据训练。加速推理,支持Web部署、Windows桌面部署和Android部署
whisper-hakka
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
yfliao's Repositories
yfliao/Alibaba-MIT-Speech
Alibaba speech technology
yfliao/ANPR-Tensorflow
Using neural networks to build an automatic number plate recognition system.
yfliao/ASR_course
ASR course at Chula 2018
yfliao/bilm-tf
Tensorflow implementation of contextualized word representations from bi-directional language models
yfliao/ChhoeTaigi
ChhoeTaigi 找台語
yfliao/cn-text-normalizer
A python module that convert chinese written string to read string. 一个python包:将中文书面字符串转换为口语字符串。
yfliao/Convert-ppt-to-pdf
AppleScript to convert doc/ppt to pdf
yfliao/dc_tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
yfliao/deep-voice-conversion
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
yfliao/ECSD
E-Commerce Sentiment Dict
yfliao/exercise
exercise for nndl
yfliao/finetune-transformer-lm
Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
yfliao/Hierarchical-Recurrent-Neural-Networks-for-Speech-Bandwidth-Extension
Codes of the paper: * Zhen-Hua Ling , Yang Ai, Yu Gu, and Li-Rong Dai, "Waveform Modeling and Generation Using Hierarchical Recurrent Neural Networks for Speech Bandwidth Extension," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 26, no. 5, pp. 883-894, 2018.
yfliao/kaldi-yesno-tutorial
Tutorial on Kaldi for Brandeis ASR course
yfliao/Keras-GAN
Keras implementations of Generative Adversarial Networks.
yfliao/melosynth
Synthesize a continuous pitch sequence
yfliao/multi-task-kaldi
An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 training.
yfliao/Python-Wrapper-for-World-Vocoder
A Python wrapper for the high-quality vocoder "World"
yfliao/pytorch-openai-transformer-lm
A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI
yfliao/REAPER
yfliao/RMDL
RMDL: Random Multimodel Deep Learning for Classification
yfliao/segan
Speech Enhancement Generative Adversarial Network in TensorFlow
yfliao/speech-denoising-wavenet
A neural network for end-to-end speech denoising
yfliao/Speech-to-Midi
A spectrum analyzer with a MIDI output and a color-coded display for developing music
yfliao/tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model
yfliao/tacotron-1
tacotron for research on Chinese speech synthesis and Taiwanese speech synthesis from Chinese input text sequence with different granularities
yfliao/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
yfliao/tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
yfliao/wavenet_vocoder
WaveNet vocoder
yfliao/WhirlwindTourOfPython
The Jupyter Notebooks behind my OReilly report, "A Whirlwind Tour of Python"