Pinned Repositories
bird-classification
Bird Classification using VGG16-pretrained model (Caffe and Keras implementations)
caffe-segnet-cudnn5
This repository was a fork of BVLC/caffe and includes the upsample, bn, dense_image_data and softmax_with_loss (with class weighting) layers of caffe-segnet (https://github.com/alexgkendall/caffe-segnet) to run SegNet with cuDNN version 5.
EEND-vector-clustering
This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-clustering.
ISCSLP2022-CSSD-Challenge
ISCSLP2022 CSSD Challenge
kaldi
This is the official location of the Kaldi project.
kli017.github.io
Personal Blog
RNN-for-Human-Activity-Recognition-using-2D-Pose-Input
Activity Recognition from 2D pose using an LSTM RNN
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
kli017's Repositories
kli017/ISCSLP2022-CSSD-Challenge
ISCSLP2022 CSSD Challenge
kli017/kli017.github.io
Personal Blog
kli017/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
kli017/EEND-vector-clustering
This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-clustering.
kli017/kaldi
This is the official location of the Kaldi project.
kli017/RNN-for-Human-Activity-Recognition-using-2D-Pose-Input
Activity Recognition from 2D pose using an LSTM RNN
kli017/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
kli017/deeplab_v2
基于v2版本的deeplab,使用VGG16模型,在VOC2012,Pascal-context,NYU-v2等多个数据集上进行训练
kli017/espnet
End-to-End Speech Processing Toolkit
kli017/Fine_Grained_car
web demo of fine grained car model classification
kli017/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.
kli017/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
kli017/HyperVID
开源移动端车型识别[Experimental] Mobile Plateform Vehicle Identification Model
kli017/kaldi-trunk
kli017/KAN-TTS
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
kli017/kli017
Personal Blog Site
kli017/langchain-ChatGLM
langchain-ChatGLM, local knowledge based ChatGLM with langchain | 基于本地知识库的 ChatGLM 问答
kli017/lexicon
lexicon for word seg and word pronunciation
kli017/libhv
🔥 比libevent、libuv更易用的网络库。A c/c++ network library for developing TCP/UDP/SSL/HTTP/WebSocket/MQTT client/server.
kli017/loguru
Python logging made (stupidly) simple
kli017/postgress-boot
Postgress With Spring boot
kli017/Printed_chinesechar_deeprecog
deep network for common used printed chinese character classification
kli017/resnet-protofiles
Caffe Protofiles for MSRA ResNet: train prototxt
kli017/SenseVoice
Multilingual Voice Understanding Model
kli017/transducer-tutorial
Example code for a neural transducer model.
kli017/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
kli017/vosk-server
WebSocket and gRPC server for speech recognition based on Kaldi and Vosk libraries
kli017/wekws_dev
Production First and Production Ready End-to-End Keyword Spotting Toolkit
kli017/wenet-online-decoder-onnx
kli017/wfrest
C++ Web Framework REST API