Pinned Repositories
CJK-character-scrape
Using httplib2 and CC-CEDICT Chinese-English dictionary, retrieve all chinese character text in a file alongside its definitions and places the result into a CSV file.
Emotion-Classification-Ravdess
Understanding emotions with Neural Networks (Python, Scikit-Learn, Keras) and the Ravdess dataset.
espnet
End-to-End Speech Processing Toolkit
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
taiwanese-tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
taiwanese_tonal_tlpa_tacotron2
voice-vector
A deep neural network for finding text-independent speaker embedding written in tensorflow
wavetomidi
to make a wave file to a standard midi file , using stft.
Whisper-Finetune
微调Whisper语音识别模型,支持无时间戳数据训练,有时间戳数据训练、无语音数据训练。加速推理,支持Web部署、Windows桌面部署和Android部署
whisper-hakka
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
yfliao's Repositories
yfliao/whisper-hakka
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
yfliao/espnet
End-to-End Speech Processing Toolkit
yfliao/Whisper-Finetune
微调Whisper语音识别模型,支持无时间戳数据训练,有时间戳数据训练、无语音数据训练。加速推理,支持Web部署、Windows桌面部署和Android部署
yfliao/Alpaca-CoT
We extend CoT data to Alpaca to boost its reasoning ability. We are constantly expanding our collection of instruction-tuning data, and integrating more LLMs together for easy use. (我们将CoT数据扩展到Alpaca以提高其推理能力,同时我们将不断收集更多的instruction-tuning数据集,并在我们框架下集成进更多的LLM,打造一个通用的LLM-IFT平台。)
yfliao/alpaca_lora_4bit
yfliao/asr-evaluation
Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
yfliao/Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
yfliao/bark-with-voice-clone
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
yfliao/batchalign
Language sample analysis tooling built around CLAN and CHAT transcripts. Dev branch is feat/next.
yfliao/byol-a
BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation
yfliao/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU部署 (Chinese LLaMA & Alpaca LLMs)
yfliao/Chinese-Vicuna
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca
yfliao/CVPR2024
CVPR 2024 Research Paper with Code
yfliao/Data-Copilot
Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow
yfliao/dolly
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
yfliao/EntST
Entailment self-training
yfliao/Formosa_Hakka_Whisper
yfliao/FSR-2023-Hakka-ASR-Scoring
yfliao/GeneFace
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code
yfliao/inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
yfliao/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Model for All.
yfliao/Multimodal-GPT
Multimodal-GPT
yfliao/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
yfliao/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
yfliao/python-pesq
A python package for calculating the PESQ.
yfliao/taiwanese_chinese_character_tacotron2
yfliao/taiwanese_toneless_tlpa_tacotron2
yfliao/TalkLip
yfliao/whisper-finetuning
[WIP] Scripts for fine-tuning Whisper
yfliao/yt-dlp
A youtube-dl fork with additional features and fixes