Pinned Repositories
AdaBoost
Small and easy C++ AdaBoost Implementation
Efficient_ImageNet_Classification
An efficient implementation for ImageNet classification
face_mask_tengine_lite
mtcnn-2
face detection and alignment with mtcnn
MultiModal-Active-Speaker-Detection
retinaface_pt
SCR-Face-Detection
Sample and Computation Redistribution for Efficient Face Detection
talking-face-arxiv-daily
🎓 Update Talking-Face Research Papers Daily, Now Integrated with LLM Analysis.
tensorflow-tutorial-1
Some tensorflow demo
YoloOCLInference
An extremely light weight tiny-YOLO inference engine targeted towards OpenCL hardware.
azuredsky's Repositories
azuredsky/SCR-Face-Detection
Sample and Computation Redistribution for Efficient Face Detection
azuredsky/darknet
Convolutional Neural Networks
azuredsky/MODNet
A Trimap-Free Solution for Portrait Matting in Real Time under Changing Scenes
azuredsky/TalkNet_ASD
TalkNet: Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection
azuredsky/asteroid
The PyTorch-based audio source separation toolkit for researchers
azuredsky/AV-Speech-Separation-API
Code for on-demand server API that runs on Google Colab. Provides inference for audio-visual speech separation.
azuredsky/cap
azuredsky/CelebA-Spoof
[ECCV2020] A Large-Scale Face Anti-Spoofing Dataset
azuredsky/CelebAMask-HQ
A large-scale face dataset for face parsing, recognition, generation and editing.
azuredsky/CVPR-2021-Papers
azuredsky/Deep-Learning-for-Tracking-and-Detection
Collection of papers and other resources for object tracking and detection using deep learning
azuredsky/DeepFashion2
DeepFashion2 Dataset https://arxiv.org/pdf/1901.07973.pdf
azuredsky/disentangled-vae
Repository for the paper "Disentanglement Learning for Variational Autoencoders Applied to Audio-Visual Speech Enhancement".
azuredsky/DNN-based_source_separation
A PyTorch implementation of DNN-based source separation.
azuredsky/Face-landmarks-detection-benchmark
Face landmarks(fiducial points) detection benchmark
azuredsky/Keras_insightface
Insightface Keras implementation
azuredsky/Lipreading_using_Temporal_Convolutional_Networks
ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks
azuredsky/Looking-to-Listen-at-the-Cocktail-Party
Executable code based on Google articles
azuredsky/PINTO_model_zoo
A repository that shares tuning results of trained models generated by TensorFlow / Keras. Post-training quantization (Weight Quantization, Integer Quantization, Full Integer Quantization, Float16 Quantization), Quantization-aware training. TensorFlow Lite. OpenVINO. CoreML. TensorFlow.js. TF-TRT. MediaPipe. ONNX. [.tflite,.h5,.pb,saved_model,tfjs,tftrt,mlmodel,.xml/.bin, .onnx]
azuredsky/pseudo-visual-speech-denoising
Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021
azuredsky/pytorch-grad-cam
Many Class Activation Map methods implemented in Pytorch for CNNs and Vision Transformers. Including Grad-CAM, Grad-CAM++, Score-CAM, Ablation-CAM and XGrad-CAM
azuredsky/PyTorch-Quantization-Aware-Training
PyTorch Quantization Aware Training Example
azuredsky/RepPointsV2
azuredsky/Soft-DTW-Loss
PyTorch implementation of Soft-DTW: a Differentiable Loss Function for Time-Series in CUDA
azuredsky/StainGAN
azuredsky/stylegan2-ada-pytorch
StyleGAN2-ADA - Official PyTorch implementation
azuredsky/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
azuredsky/TFace
A trusty face recognition research platform developed by Tencent Youtu Lab
azuredsky/VisualVoice
Audio-Visual Speech Separation with Cross-Modal Consistency
azuredsky/yolox-keras
这是一个yolox-keras的源码,可以用于训练自己的模型。