Pinned Repositories
CJK-character-scrape
Using httplib2 and CC-CEDICT Chinese-English dictionary, retrieve all chinese character text in a file alongside its definitions and places the result into a CSV file.
Emotion-Classification-Ravdess
Understanding emotions with Neural Networks (Python, Scikit-Learn, Keras) and the Ravdess dataset.
espnet
End-to-End Speech Processing Toolkit
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
taiwanese-tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
taiwanese_tonal_tlpa_tacotron2
voice-vector
A deep neural network for finding text-independent speaker embedding written in tensorflow
wavetomidi
to make a wave file to a standard midi file , using stft.
Whisper-Finetune
微调Whisper语音识别模型,支持无时间戳数据训练,有时间戳数据训练、无语音数据训练。加速推理,支持Web部署、Windows桌面部署和Android部署
whisper-hakka
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
yfliao's Repositories
yfliao/wavetomidi
to make a wave file to a standard midi file , using stft.
yfliao/audio_to_midi_melodia
Extract the melody from an audio file and export to MIDI
yfliao/awesome-kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
yfliao/DaCiDian
DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)
yfliao/dl-colab-notebooks
Try out deep learning models online on Google Colab
yfliao/End-to-end-ASR-Pytorch
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.
yfliao/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
yfliao/frugally-deep
Header-only library for using Keras models in C++.
yfliao/hue7jip8
台語、族語、客語的語料清單、彙整
yfliao/imgaug
Image augmentation for machine learning experiments.
yfliao/Kaldi-ASR-in-CoLab
Kaldi-ASR0in-CoLab
yfliao/KWS-Scripts
Keyword Search Recipe for Subword ASR
yfliao/libfvad
Voice activity detection (VAD) library, based on WebRTC's VAD engine
yfliao/mellotron
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
yfliao/midi2wave
Wavenet conditioned on midi for music synthesis
yfliao/ML-KWS-for-MCU
Keyword spotting on Arm Cortex-M Microcontrollers
yfliao/music-transcription
Automatic Music Transcription (AMT) experiments using MusicNet
yfliao/nndl.github.io
《神经网络与深度学习》 Neural Network and Deep Learning
yfliao/py-nltools
A collection of basic python modules for spoken natural language processing
yfliao/py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
yfliao/rubberband
An audio time-stretching and pitch-shifting library and utility program.
yfliao/Speech-enhancement
Deep neural network based speech enhancement toolkit
yfliao/summary
summaries of all the papers I read
yfliao/TensorFlow-2.x-Tutorials
TensorFlow 2.x version's Tutorials and Examples, including CNN, RNN, GAN, Auto-Encoders, FasterRCNN, GPT, BERT examples, etc. TF 2.0版入门实例代码,实战教程。
yfliao/tensorflow-quantization-example
TensorFlow Quantization Example, for TensorFlow Lite
yfliao/tensorflow_lite_guide
Guide for quantization, conversation of the tensorflow model to tensorflow lite
yfliao/Text_Classification
Text Classification Algorithms: A Survey
yfliao/Wave-U-Net
Implementation of the Wave-U-Net for audio source separation
yfliao/waveglow
A Flow-based Generative Network for Speech Synthesis
yfliao/X-Punctuator
A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text without punctuation.