nieshaoshuai
speech enhancement, deep learning, keywords spotting, ASR
NLPR, CASIA95, East Road, Zhongguancun, Haidian, Beijing, China
Pinned Repositories
3D-convolutional-speaker-recognition
3dgan-release
A_Guide_to_Running_Tensorflow_Models_on_Android
This is the code for"A Guide to Running Tensorflow Models on Android" By SIraj Raval on Youtube
ABCNN
Implmentiaion of ABCNN(Attention-Based Convolutional Neural Network) on Tensorflow
acapellabot
Acapella Extraction with a ConvNet
aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
LSTM-MATLAB
phasen
A unofficial Pytorch implementation of Microsoft's PHASEN
Seq2seqAdapt_paper
TTS
Deep learning for Text2Speech
nieshaoshuai's Repositories
nieshaoshuai/phasen
A unofficial Pytorch implementation of Microsoft's PHASEN
nieshaoshuai/AM_TR
nieshaoshuai/at16k
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
nieshaoshuai/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
nieshaoshuai/awtk
AWTK = Toolkit AnyWhere(为嵌入式、手机和PC打造的通用GUI系统)
nieshaoshuai/bash.py
An inline Bash script runner, for Python.
nieshaoshuai/chatbot
一个可以自己进行训练的中文聊天机器人, 根据自己的语料训练出自己想要的聊天机器人,可以用于智能客服、在线问答、智能聊天等场景。加入seqGAN版本。
nieshaoshuai/chinese2digits
最好的汉字数字(中文数字)-阿拉伯数字转换工具。包含"点二八","负百分之四十"等众多汉语表达方法。NLP,机器人工程必备! The Best Tool of Chinese Number to Digits
nieshaoshuai/CNN-Inference-Engine-Quick-View
A quick view of high-performance convolution neural networks (CNNs) inference engines on mobile devices.
nieshaoshuai/datasets-CMU_Wilderness
CMU Wilderness Multilingual Speech Dataset
nieshaoshuai/distiller
A large scale study of Knowledge Distillation. NYU Computer Vision Project
nieshaoshuai/DNS-Challenge
This repo contains the scripts, models and required files for the Interspeech 2020 Deep Noise Suppression (DNS) Challenge. We are open sourcing clean speech and noise files as well. Participants of this challenge will use the scripts from this repo to create data to train their noise suppressors. They will compare their method with our baseline noise suppressor and report the results.
nieshaoshuai/faceswap-GAN
A denoising autoencoder + adversarial losses and attention mechanisms for face swapping.
nieshaoshuai/FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication)
nieshaoshuai/hanzi_char_featurizer
汉字字符特征提取器 (featurizer),提取汉字的特征(发音特征、字形特征)用做深度学习的特征
nieshaoshuai/mace-models
Mobile AI Compute Engine Model Zoo
nieshaoshuai/nnom
A higher-level Neural Network library for microcontrollers.
nieshaoshuai/onnxruntime
ONNX Runtime
nieshaoshuai/over9000
Over9000 optimizer
nieshaoshuai/pkuseg-python
python版本:高准确度中文分词工具,简单易用,跟现有开源工具相比大幅提高了分词的准确率。
nieshaoshuai/PocketFlow
An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.
nieshaoshuai/project_alias
Alias is a teachable “parasite” that is designed to give users more control over their smart assistants, both when it comes to customisation and privacy. Through a simple app the user can train Alias to react on a custom wake-word/sound, and once trained, Alias can take control over your home assistant by activating it for you.
nieshaoshuai/PySimpleGUI
Launched in 2018 Actively developed and supported. Super-simple to create custom layout GUI's. Python 2.7 & 3 Support. 100 Demo programs & Cookbook for rapid start. Extensive documentation. Examples using Machine Learning(GUI, OpenCV Integration, Chatterbot), Floating Desktop Widgets, Matplotlib + Pyplot integration, add GUI to command line scripts, PDF & Image Viewer. Great for beginners as well as advanced GUI programmers..
nieshaoshuai/pytorch-distributed
A quickstart and benchmark for pytorch distributed training.
nieshaoshuai/RetinaFace-Cpp
RetinaFace detector with C++
nieshaoshuai/SceneTextPapers
Tracking the latest progress in Scene Text Detection and Recognition: Must-read papers well organized
nieshaoshuai/sparse_attention
Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"
nieshaoshuai/Speaker-Diarization
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
nieshaoshuai/syncnet_python
nieshaoshuai/voclib
A single file, public domain C implementation of a traditional vocoder.