Pinned Repositories
3DDFA
The PyTorch improved version of TPAMI 2017 paper: Face Alignment in Full Pose Range: A 3D Total Solution.
500lines
500 Lines or Less
ABINet
Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition
AdelaiDet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
adversarial
Code and hyperparameters for the paper "Generative Adversarial Networks"
AdvSemiSeg
Pytorch implementation of the paper "Adversarial Learning for Semi-supervised Semantic Segmentation," Wei-Chih Hung, Yi-Hsuan Tsai, Yan-Ting Liou, Yen-Yu Lin, and Ming-Hsuan Yang
pytorch-fbs
pytorch implementation of "Dynamic Channel Pruning: Feature Boosting and Suppression"
triton-inference-server
The Triton Inference Server provides a cloud inferencing solution optimized for NVIDIA GPUs.
VMZ
R(2+1)D and Mixed-Convolutions for Action Recognition.
wenhuach's Repositories
wenhuach/triton-inference-server
The Triton Inference Server provides a cloud inferencing solution optimized for NVIDIA GPUs.
wenhuach/ABINet
Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition
wenhuach/avbert
wenhuach/AVID-CMA
Audio Visual Instance Discrimination with Cross-Modal Agreement
wenhuach/Chinese-BERT-wwm
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
wenhuach/CPM
Introduction to CPM
wenhuach/DynaVSR
DynaVSR: Dynamic Adaptive Blind VideoSuper-Resolution
wenhuach/EssentialMC2
EssentialMC2 Video Understanding.
wenhuach/ffmpeg-libav-tutorial
FFmpeg libav tutorial - learn how media works from basic to transmuxing, transcoding and more
wenhuach/first-order-model
This repository contains the source code for the paper First Order Motion Model for Image Animation
wenhuach/GPT2-Chinese
Chinese version of GPT2 training code, using BERT tokenizer.
wenhuach/HAWQ
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
wenhuach/mediapipe
MediaPipe is the simplest way for researchers and developers to build world-class ML solutions and applications for mobile, edge, cloud and the web.
wenhuach/Megatron-LM
Ongoing research training transformer language models at scale, including: BERT & GPT-2
wenhuach/mmaction2
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
wenhuach/ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
wenhuach/nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
wenhuach/nni
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
wenhuach/One-Shot_Free-View_Neural_Talking_Head_Synthesis
wenhuach/openvino_training_extensions
Trainable models and NN optimization tools
wenhuach/pensieve
Neural Adaptive Video Streaming with Pensieve (SIGCOMM '17)
wenhuach/pytorch-lightning
The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
wenhuach/smplify-x
Expressive Body Capture: 3D Hands, Face, and Body from a Single Image
wenhuach/smplpix
SMPLpix: Neural Avatars from 3D Human Models
wenhuach/TalkingHead-1KH
wenhuach/Ultra-Light-Fast-Generic-Face-Detector-1MB
💎1MB lightweight face detection model (1MB轻量级人脸检测模型)
wenhuach/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
wenhuach/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
wenhuach/youtube-dl
A fork of youtube-dl, for archival purposes.
wenhuach/YOWO
You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization