Pinned Repositories
AugLy
A data augmentations library for audio, image, text, and video.
awesome-kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
awesome-keyword-spotting
This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
awesome-speech-resources
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
countword
用python实现统计多种文本单词,汉字,以及常用的格式转换
cpp_new_features
2021年最新整理, C++ 学习资料,含C++ 11 / 14 / 17 / 20 / 23 新特性、入门教程、推荐书籍、优质文章、学习笔记、教学视频等
k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
machine_learning
机器学习的相关知识总结
onlinemusic
java,web,jsp,在线音乐系统
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
xiexukang's Repositories
xiexukang/AugLy
A data augmentations library for audio, image, text, and video.
xiexukang/futurecoder
A platform for beginners to learn programming in Python
xiexukang/ACGPN
"Towards Photo-Realistic Virtual Try-On by Adaptively Generating↔Preserving Image Content",CVPR 2020. (Modified from original with fixes for inference)
xiexukang/android-reverse
安卓逆向工具汇总 / Awsome Android Reverse Tools
xiexukang/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
xiexukang/EasyEspnet
Making Espnet easier to use
xiexukang/edgedict
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
xiexukang/EfficientConformer
Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition
xiexukang/End-to-end-ASR-Pytorch
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.
xiexukang/exkaldi-rt
An online speech recognition extension toolkit of Kaldi
xiexukang/Informer2020
The GitHub repository for the paper "Informer" accepted by AAAI 2021.
xiexukang/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
xiexukang/mandarin-tts
Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets
xiexukang/ML_Notes
机器学习算法的公式推导以及numpy实现
xiexukang/models
Pre-trained and Reproduced Deep Learning Models (『飞桨』官方模型库,包含多种学术前沿和工业场景验证的深度学习模型)
xiexukang/neural_sp
End-to-end ASR/LM implementation with PyTorch
xiexukang/OpenNMT-py
Open Source Neural Machine Translation in PyTorch
xiexukang/openspeech
Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.
xiexukang/powerful_chinese_ASR
facebook 开源的 wav2vec模型. 确实很sota
xiexukang/Pytorch_to_onnx
The summary about pytorch code to onnx
xiexukang/self-supervised-speech-recognition
speech to text with self-supervised learning based on wav2vec 2.0 framework
xiexukang/speech_tools
xiexukang/speechbrain
A PyTorch-based Speech Toolkit
xiexukang/Transformer-Transducer
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)
xiexukang/ttskit
语音合成工具箱,Text To Speech Toolkit,多种音色可供选择的语音合成工具。
xiexukang/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
xiexukang/voxceleb_trainer
In defence of metric learning for speaker recognition
xiexukang/wenet
Transformer based ASR Engine.
xiexukang/xiexukang
Config files for my GitHub profile.
xiexukang/zhrtvc
Chinese real time voice cloning (VC) and Chinese text to speech (TTS). 好用的中文语音克隆兼中文语音合成系统,包含语音编码器、语音合成器、声码器和可视化模块。