xiexukang

Master's student at JiangNan Universitychina

Pinned Repositories

AugLy
A data augmentations library for audio, image, text, and video.
Language:Python3 0 00
awesome-kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
00
awesome-keyword-spotting
This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
0 0 00
awesome-speech-resources
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
0 0 00
countword
用python实现统计多种文本单词，汉字，以及常用的格式转换
Language:Python1 0 00
cpp_new_features
2021年最新整理， C++ 学习资料，含C++ 11 / 14 / 17 / 20 / 23 新特性、入门教程、推荐书籍、优质文章、学习笔记、教学视频等
Language:C++0 0 00
k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
Language:Cuda0 0 00
machine_learning
机器学习的相关知识总结
Language:Jupyter Notebook10
onlinemusic
java,web,jsp,在线音乐系统
20
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook0 0 00

xiexukang's Repositories

xiexukang/AugLy
A data augmentations library for audio, image, text, and video.
Language:Python3 0 00
xiexukang/futurecoder
A platform for beginners to learn programming in Python
Language:Python0 0 00
xiexukang/ACGPN
"Towards Photo-Realistic Virtual Try-On by Adaptively Generating↔Preserving Image Content"，CVPR 2020. (Modified from original with fixes for inference)
xiexukang/android-reverse
安卓逆向工具汇总 / Awsome Android Reverse Tools
0 0
xiexukang/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
xiexukang/EasyEspnet
Making Espnet easier to use
Language:Python0 0
xiexukang/edgedict
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
xiexukang/EfficientConformer
Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition
xiexukang/End-to-end-ASR-Pytorch
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.
xiexukang/exkaldi-rt
An online speech recognition extension toolkit of Kaldi
xiexukang/Informer2020
The GitHub repository for the paper "Informer" accepted by AAAI 2021.
Language:Python
xiexukang/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Language:Shell0 0
xiexukang/mandarin-tts
Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets
Language:Python0 0
xiexukang/ML_Notes
机器学习算法的公式推导以及numpy实现
xiexukang/models
Pre-trained and Reproduced Deep Learning Models （『飞桨』官方模型库，包含多种学术前沿和工业场景验证的深度学习模型）
xiexukang/neural_sp
End-to-end ASR/LM implementation with PyTorch
Language:Python
xiexukang/OpenNMT-py
Open Source Neural Machine Translation in PyTorch
xiexukang/openspeech
Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.
Language:Python0 0
xiexukang/powerful_chinese_ASR
facebook 开源的 wav2vec模型. 确实很sota
xiexukang/Pytorch_to_onnx
The summary about pytorch code to onnx
Language:Python1 01
xiexukang/self-supervised-speech-recognition
speech to text with self-supervised learning based on wav2vec 2.0 framework
xiexukang/speech_tools
1 0
xiexukang/speechbrain
A PyTorch-based Speech Toolkit
Language:Python0 0
xiexukang/Transformer-Transducer
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)
Language:Python0 0
xiexukang/ttskit
语音合成工具箱，Text To Speech Toolkit，多种音色可供选择的语音合成工具。
Language:Python0 0
xiexukang/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Language:C++0 0
xiexukang/voxceleb_trainer
In defence of metric learning for speaker recognition
Language:Python0 0
xiexukang/wenet
Transformer based ASR Engine.
Language:C++
xiexukang/xiexukang
Config files for my GitHub profile.
xiexukang/zhrtvc
Chinese real time voice cloning (VC) and Chinese text to speech (TTS). 好用的中文语音克隆兼中文语音合成系统，包含语音编码器、语音合成器、声码器和可视化模块。
Language:Python0 0