Pinned Repositories
code-for-Lucas-Kanade-20-Years-On
kaldi-native-fbank
Kaldi-compatible online fbank extractor without external dependencies
kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
onnxruntime-libs
onnxruntime pre-compiled libs
OpenCNN
An Open Convolutional Neural Network Framework in C++ From Scratch
optimized_transducer
Memory efficient transducer loss computation
transducer-loss-benchmarking
sherpa
Speech-to-text server framework with next-gen Kaldi
sherpa-ncnn
Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, LicheePi4A etc.
sherpa-onnx
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages
csukuangfj's Repositories
csukuangfj/kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
csukuangfj/onnxruntime-libs
onnxruntime pre-compiled libs
csukuangfj/kaldi-native-fbank
Kaldi-compatible online fbank extractor without external dependencies
csukuangfj/PortAudioSharp2
C# binding for portaudio supporting Linux, macOS, Windows, iOS
csukuangfj/kaldilm
Python wrapper for kaldi's arpa2fst
csukuangfj/kaldi-hmm-gmm
csukuangfj/onnxruntime-build
A build project for ONNX Runtime
csukuangfj/sherpa-onnx
csukuangfj/icefall
csukuangfj/naudiodon2
Node.js stream bindings for PortAudio
csukuangfj/ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
csukuangfj/sherpa
Speech-to-text server framework with next-gen Kaldi
csukuangfj/sherpa-ncnn
csukuangfj/tts-dataset-creation
Create dataset for tts
csukuangfj/k2
FSA/FST algorithms, intended to (eventually) be interoperable with PyTorch and similar
csukuangfj/kaldialign
Python wrappers for Kaldi Levenshtein's distance and alignment code.
csukuangfj/kaldifst
Python wrapper for OpenFST and its extensions from Kaldi
csukuangfj/mecat
csukuangfj/piper
A fast, local neural text to speech system
csukuangfj/piper-phonemize
C++ library for converting text to phonemes for Piper
csukuangfj/500lines
500 Lines or Less
csukuangfj/colab
Colab notebooks for Next-gen Kaldi
csukuangfj/kaldi-decoder
Decoders from Kaldi using OpenFst
csukuangfj/lilcom
Small compression utility
csukuangfj/mlx
MLX: An array framework for Apple silicon
csukuangfj/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
csukuangfj/pnnx
PyTorch Neural Network eXchange
csukuangfj/sherpa-mlx
sherpa with mlx
csukuangfj/test-github-actions
csukuangfj/voxpopuli
A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation