Pinned Repositories
algorithm-realization
the realization of different machine learning and speech processing algorithms
alpaca-lora
Instruct-tune LLaMA on consumer hardware
asr-notes
平时学习工作的笔记
asr-work-mini
For my son, do asr and nlu annotation works.
chinese-asr-kaldi-and-other
Start now, first build a model for chinese from commonvoice, then use keras to build end2end model, keep updating
Interview-Notebook
:calendar: 准备秋招学习笔记
lstm_ctc_ocr
Use CTC + tensorflow to OCR
mms-alignment-tools
using MMS to do the audio-transcript alignment
snowboy
DNN based hotword and wake word detection toolkit
SupportNet
SupportNet
MXuer's Repositories
MXuer/mms-alignment-tools
using MMS to do the audio-transcript alignment
MXuer/asr-notes
平时学习工作的笔记
MXuer/asr-work-mini
For my son, do asr and nlu annotation works.
MXuer/alpaca-lora
Instruct-tune LLaMA on consumer hardware
MXuer/asr-notes-e2e
端到端语音识别相关的一些笔记
MXuer/books-notes
一些读书笔记
MXuer/documents_llama
MXuer/draw-e2e-arch
端到端语音识别模型的结构图
MXuer/Federated-learning-ASR
MXuer/fish-speech
Brand new TTS solution
MXuer/FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
MXuer/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
MXuer/git-flight-rules
Flight rules for git
MXuer/icefall
MXuer/inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
MXuer/Kindle_download_helper
Download all your kindle books script.
MXuer/lhotse
Tools for handling speech data in machine learning projects.
MXuer/mini-asr
code practice for asr models including las, ctc, rnn-t and others.
MXuer/notes-for-notes
记一些笔记。
MXuer/notesbooks
日常工作中用到的一些小的活,用jupyter notebook干的
MXuer/pdf2excel
pdf2excel
MXuer/pdf_tools
pdf的一些操作(提取/翻译)
MXuer/reading-paper-notes
notes for paper reading
MXuer/sft_datacollections
MXuer/speech-recognition-papers
Towards hot directions in industrial speech recognition
MXuer/speechocean762
A dataset for pronunciation scoring tasks.
MXuer/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
MXuer/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
MXuer/whisper-asr-finetune
MXuer/whisper-eval
用Whisper不同的模型,在不同语种、不同测试集上的效果。