Pinned Repositories
add-noise
add noise of a certain SNR to audio files
aiyinyue
alexa-printer-backend
Service that bridges voice assistants with IoT clients over AMQP.
android-webrtc-vad
webrtc-vad(单独抽取webrtc中的vad模块,编译成so库移植android平台使用)
ApproxMVBB
Fast algorithms to compute an approximation of the minimal volume oriented bounding box of a point cloud in 3D.
ASR_WORD
采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。
ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
awesome-3D-vision
3D computer vision incuding SLAM,VSALM,Deep Learning,Structured light,Stereo,Three-dimensional reconstruction,Computer vision,Machine Learning and so on
PRIDNet
Code for the paper "Pyramid Real Image Denoising Network"
Speaker_Verification
Academic Project for Speech and Speaker Recognition course DD2119
MingmChen's Repositories
MingmChen/awesome-3D-vision
3D computer vision incuding SLAM,VSALM,Deep Learning,Structured light,Stereo,Three-dimensional reconstruction,Computer vision,Machine Learning and so on
MingmChen/aiyinyue
MingmChen/ApproxMVBB
Fast algorithms to compute an approximation of the minimal volume oriented bounding box of a point cloud in 3D.
MingmChen/AudioDemo-1
Audio API demo on Android platform
MingmChen/chhRobotics_CPP
自动驾驶规划控制常用算法c++代码实现
MingmChen/cmake-demo
《CMake入门实战》源码
MingmChen/DCRNN
Implementation of Diffusion Convolutional Recurrent Neural Network in Tensorflow
MingmChen/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
MingmChen/Dual-Microphone-Noise-Reduction-by-PLD-Technique
Working on a dual-microphone noise reduction for mobile phone in noisy environment by Power Level Different Technique (PLD).
MingmChen/face
一个具有人脸识别、人脸对比、人脸/人体追踪、真人鉴别、图像质量检测的综合智能安防系统
MingmChen/FreeCAD-0.19_pre
MingmChen/GuidedDenoising
Guided Mesh Normal Filtering
MingmChen/Maixduino
Arduino port on Maix board ( k210 )
MingmChen/MeshSimplification
A mesh simplification algorithm by shrinking faces
MingmChen/mnn_example
alibaba MNN, mobilenet classifier, centerface detecter, ultraface detecter, pfld landmarker and zqlandmarker, mobilefacenet
MingmChen/MS-SNSD
The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) levels desired.
MingmChen/nnom
A higher-level Neural Network library for microcontrollers.
MingmChen/OpenGL-development-tour
MingmChen/PaddleX
PaddlePaddle Entire Process Development Toolkit(『飞桨』深度学习全流程开发工具)
MingmChen/PJCurvature
Calculate the curvature of discrete points
MingmChen/Point-Cloud-Processing-example
点云库PCL从入门到精通 书中配套案例
MingmChen/Realtime_AudioDenoise_EchoCancellation
MingmChen/slam_in_autonomous_driving
《自动驾驶中的SLAM技术》对应开源代码
MingmChen/slam_in_autonomous_driving_change
高博新书《自动驾驶与机器人中的SLAM技术》源码修改版,根据深蓝学院要求,对每一章的代码进行特定修改,以实现不同的功能。
MingmChen/slambook2
edition 2 of the slambook
MingmChen/sonic
Simple library to speed up or slow down speech
MingmChen/SoundLocation
基于pynq-z2的声源定位系统
MingmChen/TDA-ReCTS
A Validation Set for Text Detection Ambiguity
MingmChen/webrtc_android
webrtc VideoCall VideoConference 视频通话 视频会议
MingmChen/ZlwAudioRecorder
AudioRecorder: Android 录音及录音可视化相关lib,支持pcm、wav、mp3音频的录制