Pinned Repositories
Build-CNN-or-LSTM-or-CNNLSTM-with-speech-features
A set of scripts that extract speech features (so far MFCCs, FBANKs, STFT, and kinda dominant frequency) and trains CNN, LSTM, or CNN+LSTM models with those features.
CLIP
Contrastive Language-Image Pretraining
conv_arithmetic
A technical report on convolution arithmetic in the context of deep learning
end2end_ASR
端到端语音识别实现;包含LAS、CTC、RNNT解码方式,模型SA(MHA)、LSTM、CNN、DFSMN等
face-detection-pytorch
PyTorch implementations of various algorithms.
face-evaluation
Face evaluation method, such as FDDB, WIDERFace, Megaface, etc.
FaceDistance
A face recognition algorithm with the distance detector.
guided-filter
Implementation of Guided Image Filtering.
HRNet-Facial-Landmark-Detection
This is an official implementation of facial landmark detection for our TPAMI paper "Deep High-Resolution Representation Learning for Visual Recognition". https://arxiv.org/abs/1908.07919
Indoor-Pet-Detection
A demo project for indoor pet detection using Unity generated synthetic data.
chn91127's Repositories
chn91127/Build-CNN-or-LSTM-or-CNNLSTM-with-speech-features
A set of scripts that extract speech features (so far MFCCs, FBANKs, STFT, and kinda dominant frequency) and trains CNN, LSTM, or CNN+LSTM models with those features.
chn91127/CLIP
Contrastive Language-Image Pretraining
chn91127/conv_arithmetic
A technical report on convolution arithmetic in the context of deep learning
chn91127/end2end_ASR
端到端语音识别实现;包含LAS、CTC、RNNT解码方式,模型SA(MHA)、LSTM、CNN、DFSMN等
chn91127/face-detection-pytorch
PyTorch implementations of various algorithms.
chn91127/face-evaluation
Face evaluation method, such as FDDB, WIDERFace, Megaface, etc.
chn91127/FaceDistance
A face recognition algorithm with the distance detector.
chn91127/guided-filter
Implementation of Guided Image Filtering.
chn91127/HRNet-Facial-Landmark-Detection
This is an official implementation of facial landmark detection for our TPAMI paper "Deep High-Resolution Representation Learning for Visual Recognition". https://arxiv.org/abs/1908.07919
chn91127/Indoor-Pet-Detection
A demo project for indoor pet detection using Unity generated synthetic data.
chn91127/Keep-Learning
The record of what I‘ve been through.
chn91127/LAS_Mandarin_PyTorch
Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)
chn91127/LBPH-vs-LMDeP
it is a comparitive study between various face feature extraction method.
chn91127/metaq-research
我对metaq的研究
chn91127/mica-speech-activity-detection
Robust Speech Activity Detection (SAD) in movie audio
chn91127/PnP_Solver
Personal implementations of solvers for PnP problem, including DLT and EPnP.
chn91127/pnp_solver-1
chn91127/speech-music-detection
Python framework for Speech and Music Detection using Keras.
chn91127/taming-transformers
Taming Transformers for High-Resolution Image Synthesis
chn91127/VTuber-Python-Unity
An Implementation of VTuber (Both 3D and Live2D) using Python and Unity. Providing face movement tracking, eye blinking detection, iris detection and tracking and mouth movement tracking using CPU only.
chn91127/wavelet-monodepth
[CVPR 2021] Monocular depth estimation using wavelets for efficiency