Pinned Repositories
audino
Open source audio annotation tool for humans™
awesome-speech
this is a treasure-house of speech
chinese_text_normalization
Chinese text normalization for speech processing
ChineseGLUE
Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
CLUENER2020
CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition
CLUEPretrainedModels
高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型
CycleGAN-VC2
Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2
deep-learning-drizzle
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
speech
speech-learning
xbsdsongnan's Repositories
xbsdsongnan/Dive-into-DL-PyTorch
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。
xbsdsongnan/CTCResources
xbsdsongnan/Electron-SIMGUI
本项目是基于Electron和element UI开发的一款代码查重软件,其内核使用了SIM(SIM是Dick Grune开发的一款代码查重软件)
xbsdsongnan/TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
xbsdsongnan/espnet
End-to-End Speech Processing Toolkit
xbsdsongnan/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
xbsdsongnan/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
xbsdsongnan/wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
xbsdsongnan/free-spoken-digit-dataset
A free audio dataset of spoken digits. Think MNIST for audio.
xbsdsongnan/zamia-speech
Open tools and data for cloudless automatic speech recognition
xbsdsongnan/kaldi-model-server
Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone
xbsdsongnan/audino
Open source audio annotation tool for humans™
xbsdsongnan/deep-learning-drizzle
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
xbsdsongnan/pretrained-models
Open Language Pre-trained Model Zoo
xbsdsongnan/speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
xbsdsongnan/voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).
xbsdsongnan/TransformerTTS
🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
xbsdsongnan/zhrtvc
Chinese real time voice cloning (VC) and Chinese text to speech (TTS). 好用的中文语音克隆兼中文语音合成系统,包含语音编码器、语音合成器、声码器和可视化模块。
xbsdsongnan/BCsiNet
This is an implementation of BCsiNet for results reproduction on COST2100
xbsdsongnan/type-script_Chinese_character_realizing
汉字字体识别-分类
xbsdsongnan/chinese_text_normalization
Chinese text normalization for speech processing
xbsdsongnan/Chinese-BERT-wwm
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
xbsdsongnan/wenet
Transformer based ASR Engine.
xbsdsongnan/CycleGAN-VC2
Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2
xbsdsongnan/KaldiService
Service for easy access to speech recognition capabilities of Kaldi using REST API. Simple deployment and usage in couple clicks with Docker containers. Currently supports Russian. Models for other languages may be easily added in case of need.
xbsdsongnan/phkit
phoneme toolkit. 好用的音素处理工具箱,包含中文音素、英文音素、文本转拼音、文本正则化等模块。
xbsdsongnan/Deep-Learning-with-PyTorch-Chinese
本仓库将PyTorch官方书籍《Deep learning with PyTorch》(基本摘录版)翻译成中文版并给出可运行的相关代码。
xbsdsongnan/aukit
audio toolkit. 好用的语音处理工具箱,包含语音降噪、音频格式转换、特征频谱生成等模块。
xbsdsongnan/Python24
网上搜集的自学python语言的资料集合,包括整套代码和讲义集合,这是至今为止所开放网上能够查找到的最新视频教程,网上找不到其他最新的python整套视频了,. 具体的无加密的mp4视频教程和讲义集合可以在更新的Readme文件中找到,下载直接打开就能播放,项目从零基础的Python教程到深度学习,总共30章节,其中包含Python基础中的飞机大战项目,WSGI项目,Flask新经资讯项目, Django的电商项目(本应该的美多商城项目因为使用的是Vue技术,所以替换为Django天天生鲜项目)等等,希望能够帮助大家。资源搜集劳神费力,能帮到你的话是我的福分,望大家多多支持,喜欢本仓库的话,记得Star哦。
xbsdsongnan/zhvoice
Chinese voice corpus. 中文语音语料,语音更加清晰自然,包含8个开源数据集,3200个说话人,900小时语音,1300万字。