Pinned Repositories
AugLy
A data augmentations library for audio, image, text, and video.
awesome-kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
awesome-keyword-spotting
This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
awesome-speech-resources
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
countword
用python实现统计多种文本单词,汉字,以及常用的格式转换
cpp_new_features
2021年最新整理, C++ 学习资料,含C++ 11 / 14 / 17 / 20 / 23 新特性、入门教程、推荐书籍、优质文章、学习笔记、教学视频等
k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
machine_learning
机器学习的相关知识总结
onlinemusic
java,web,jsp,在线音乐系统
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
xiexukang's Repositories
xiexukang/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
xiexukang/3D-Speaker
A repository for single- and multi-modal speaker verification, speaker recognition and speaker diarization.
xiexukang/awesome-asr-contextualization
A curated list of awesome papers on contextualizing E2E ASR outputs
xiexukang/awesome-cpp
A curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.
xiexukang/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
xiexukang/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
xiexukang/awesome-ncnn
😎 A Collection of Awesome NCNN-based Projects
xiexukang/Cantonese-learning
粤语学习资料
xiexukang/ChatWaifu_Mobile
移动版二次元 AI 老婆聊天器
xiexukang/code-switching-papers
A curated list of research papers and resources on code-switching
xiexukang/ctc_decoder
A ctc decoder for both online and offline asr model
xiexukang/data2vec-pytorch
PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI
xiexukang/espresso
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
xiexukang/expert_readed_books
2021年最新总结,推荐工程师合适读本,计算机科学,软件技术,创业,**类,数学类,人物传记书籍
xiexukang/FastASR
这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小时), 所以识别效果也很好,可以媲美许多商用的ASR软件。
xiexukang/FastDeploy
⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end optimization, multi-platform and multi-framework support.
xiexukang/findpapers
Findpapers: A tool for helping researchers who are looking for related works
xiexukang/Grounded-Segment-Anything
分割一切
xiexukang/json
JSON for Modern C++
xiexukang/keyword-spot
端到端语音唤醒工具箱,从模型训练到模型推理。
xiexukang/myblog
myblog powered by django,xadmin
xiexukang/ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
xiexukang/OpenAI_Whisper_ASR
A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models
xiexukang/pocolm
Small language toolkit for creation, interpolation and pruning of ARPA language models
xiexukang/sherpa-ncnn
Real-time speech recognition using next-gen Kaldi with ncnn
xiexukang/torchaudio
Data manipulation and transformation for audio signal processing, powered by PyTorch
xiexukang/wenet_trt8
xiexukang/wespeaker
xiexukang/WeTextProcessing
xiexukang/whisper