zw76859420

ASR技术推进

TripShanghai

Pinned Repositories

agriculture_recognition
AI挑战赛，农作物识别，已经做好数据处理，模型训练等部分，由于时间原因交由实验室师弟进行优化。
Language:Python93
ASR_Phone
以音素建模构建NN-CTC声学模型
Language:Python15 1 14
ASR_Syllable
基于卷积神经网络的语音识别声学模型的研究
Language:Python166 4 347
ASR_Theory
语音识别理论、论文和PPT
570 18 1182
ASR_WORD
采用端到端方法构建声学模型，以字为建模单元，采用DCNN-CTC网络结构。
Language:Python71 2 221
ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Language:Python146
espnet
End-to-End Speech Processing Toolkit
Language:Shell11
image-recognition
采用深度学习方法进行刀具识别。
Language:Python23 1 27
kaggle-cats-and-dogs
采用深度学习方法进行图像识别，数据集为kaggle数据集中的猫与狗数据集。
Language:Python49 1 124
video-action-recognition
视频动作识别，基于C3D网络构建
Language:Python2913

zw76859420's Repositories

zw76859420/ASR_Syllable
基于卷积神经网络的语音识别声学模型的研究
Language:Python166 4 347
zw76859420/C-_learning
C++提高（看过c++基础之后再看）黑马培训课程，自己手打，路径 https://www.bilibili.com/video/av35939892/?p=3
Language:C++2 1 01
zw76859420/bert
TensorFlow code and pre-trained models for BERT
Language:Python1 1 0
zw76859420/DNN-HMM-Course
DNN-HMM related Experiments for THUHCSI Course : <Digital Processing of Speech Signals>
Language:Python1 1 01
zw76859420/NeuralSpeech
Language:Python1 0 0
zw76859420/SparseSelfAttention
Sparse Attention Mechanism, accepted in KSC 2019
Language:Python1 1 0
zw76859420/AIF-PyTorch
(NOT Official) Implementation Auto-regressive Integrate-and-Fire (AIF)
Language:Python0 0
zw76859420/asr-decode-simple
从Kaldi中裁剪的轻量级语音识别解码推理框架，目前实现了MFCC+GMM+Viterbi，不依赖OpenFST、OpenBLAS等库
Language:C++1 0
zw76859420/athena-decoder
Language:Python1 0
zw76859420/Bayesian_TDNN
This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for Speech Recognition"
Language:C++1 0
zw76859420/chinese-xinhua-important
:orange_book: 中华新华字典数据库。包括歇后语，成语，词语，汉字。
Language:Python1 0
zw76859420/CTC-OptimizedLoss
Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.
Language:Python0 0
zw76859420/cudafst
Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva
Language:Python0 0
zw76859420/decoder
Minimize kaldi nnet3 chain decoder
Language:C++1 0
zw76859420/DFSMN-Based-Lightweight-Speech-Enhancement
Deep Feedforward sequential memory networks(FSMN)
zw76859420/emoASR
End-to-end MOdeling of ASR (Automatic Speech Recognition)
Language:Python1 0
zw76859420/gfcc
gfcc features
Language:C++1 0
zw76859420/ksponspeech
Pre-processing KsponSpeech corpus (Korean Speech dataset) provided by AI Hub.
Language:Python1 0
zw76859420/KWS_pytorch
Keyword spotting, Speech wake_up, by pytorch, DNN, CNN, TDNN, DFSMN, LSTM
Language:Python1 0
zw76859420/neurst
Neural end-to-end Speech Translation Toolkit
zw76859420/OpenTransformer
A No-Recurrence Sequence-to-Sequence Model for Speech Recognition
Language:Python1 0
zw76859420/pkwrap
A pytorch wrapper for LF-MMI training and parallel training in Kaldi
Language:Python1 0
zw76859420/pychain
PyTorch implementation of LF-MMI for End-to-end ASR
Language:C++1 0
zw76859420/sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
Language:C++1 0
zw76859420/SimilarCharacter
对常用的6700个汉字进行音、形比较，输出音近字、形近字的列表。 # 相近字
Language:Python1 0
zw76859420/snowfall
Language:Python1 0
zw76859420/warp-rna
Recurrent Neural Aligner
Language:Python1 0
zw76859420/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Language:Python1 0
zw76859420/whisper
Language:Jupyter Notebook1 0
zw76859420/whisper-finetune
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
Language:Python0 0