Pinned Repositories
123sasa
2s-AGCN
Two-Stream Adaptive Graph Convolutional Networks for Skeleton-Based Action Recognition in CVPR19
action-faster-rcnn
ActionDetection-AFSD
Code for CVPR2021 paper "Learning Salient Boundary Feature for Anchor-free Temporal Action Localization"
AS-CAL
Augmented Skeleton Based Contrastive Action Learning with Momentum LSTM for Unsupervised Action Recognition
Bidirectional-LSTM-VAE
c3dTwostream
CaffeInstallationScript
LSTM_LIP_READING
pseudo-3d-residual-networks
Pseudo-3D Convolutional Residual Networks for Video Representation Learning
HenglinShi's Repositories
HenglinShi/Bidirectional-LSTM-VAE
HenglinShi/CaffeInstallationScript
HenglinShi/123sasa
HenglinShi/2s-AGCN
Two-Stream Adaptive Graph Convolutional Networks for Skeleton-Based Action Recognition in CVPR19
HenglinShi/ActionDetection-AFSD
Code for CVPR2021 paper "Learning Salient Boundary Feature for Anchor-free Temporal Action Localization"
HenglinShi/AS-CAL
Augmented Skeleton Based Contrastive Action Learning with Momentum LSTM for Unsupervised Action Recognition
HenglinShi/caffe
Caffe: a fast open framework for deep learning.
HenglinShi/dotfiles
My Rice Setup
HenglinShi/ExpansionNet_v2
Implementation code of the work "ExpansionNet v2: Block Static Expansion in fast end to end training for Image Captioning"
HenglinShi/fitlog
fitlog是一款在深度学习训练中用于辅助用户记录日志和管理代码的工具
HenglinShi/frvt
Repository for the Face Recognition Vendor Test (FRVT)
HenglinShi/ghostnet
[CVPR2020] GhostNet: More Features from Cheap Operations
HenglinShi/HenglinShi.github.io
HenglinShi/meshed-memory-transformer
Meshed-Memory Transformer for Image Captioning. CVPR 2020
HenglinShi/mmaction2
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
HenglinShi/MobileCompute
HenglinShi/MobileComputing
HenglinShi/moco
PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722
HenglinShi/MS-G3D
PyTorch implementation of "Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition", CVPR 2020 Oral
HenglinShi/PoolNet
Code for our CVPR 2019 paper "A Simple Pooling-Based Design for Real-Time Salient Object Detection"
HenglinShi/PRML
Pattern recognition and machine learning toolbox
HenglinShi/PWP
HenglinShi/SimpleNES
An NES emulator in C++
HenglinShi/SwinBERT
Research code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"
HenglinShi/TRN-pytorch
Temporal Relation Networks
HenglinShi/VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
HenglinShi/ViLT
Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
HenglinShi/Visidon_Image_Translation_Test
HenglinShi/xmodaler
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
HenglinShi/xrcap
Azure Kinect multi-camera secure network capture/record/replay