Pinned Repositories
ActionDetection-AFSD
Code for CVPR2021 paper "Learning Salient Boundary Feature for Anchor-free Temporal Action Localization"
ActionDetection-DBG
activitygraph_transformer
babel
Deep learning model for single-cell inference of multi-omic profiles from a single input modality.
BE
[CVPR2021] The source code for our paper 《Removing the Background by Adding the Background: Towards Background Robust Self-supervised Video Representation Learning》.
big_transfer
Official repository for the "Big Transfer (BiT): General Visual Representation Learning" paper.
BKinD
Behavioral Keypoint Discovery
BMN-Boundary-Matching-Network
A pytorch-version implementation codes of paper: "BMN: Boundary-Matching Network for Temporal Action Proposal Generation", which is accepted in ICCV 2019.
Bottom-Up-TAL-with-MR
Implementation for Bottom-Up Temporal Action Localization with Mutual Regularization (ECCV2020)
CenseoQoE
image and video quality assessment
WellXiong's Repositories
WellXiong/ActionDetection-AFSD
Code for CVPR2021 paper "Learning Salient Boundary Feature for Anchor-free Temporal Action Localization"
WellXiong/activitygraph_transformer
WellXiong/babel
Deep learning model for single-cell inference of multi-omic profiles from a single input modality.
WellXiong/BE
[CVPR2021] The source code for our paper 《Removing the Background by Adding the Background: Towards Background Robust Self-supervised Video Representation Learning》.
WellXiong/big_transfer
Official repository for the "Big Transfer (BiT): General Visual Representation Learning" paper.
WellXiong/BKinD
Behavioral Keypoint Discovery
WellXiong/Bottom-Up-TAL-with-MR
Implementation for Bottom-Up Temporal Action Localization with Mutual Regularization (ECCV2020)
WellXiong/CenseoQoE
image and video quality assessment
WellXiong/CPT
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation
WellXiong/deit
Official DeiT repository
WellXiong/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
WellXiong/google_trans_new
A free and unlimited python API for google translate.
WellXiong/gtad
The official implementation of G-TAD: Sub-Graph Localization for Temporal Action Detection
WellXiong/LGI4temporalgrounding
Repository for the CVPR-20 paper "Local-Global Video-Text Interactions for Temporal Grounding"
WellXiong/PaddleOCR2Pytorch
PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)
WellXiong/PGT
WellXiong/pren
Code for "Primitive Representation Learning for Scene Text Recognition" (CVPR 2021)
WellXiong/PRTR
PRTR: Pose Recognition with Cascade Transformers
WellXiong/PVT
WellXiong/SceneSeg
Codebase for CVPR2020 A Local-to-Global Approach to Multi-modal Movie Scene Segmentation
WellXiong/Sentence-VAE
PyTorch Re-Implementation of "Generating Sentences from a Continuous Space" by Bowman et al 2015 https://arxiv.org/abs/1511.06349
WellXiong/STAM-An-Image-is-Worth-16x16-Words-What-is-a-Video-Worth-
Space-Time Attention Model
WellXiong/Stark
Learning Spatio-Temporal Transformer for Visual Tracking
WellXiong/TDN
[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition
WellXiong/TimeSformer
The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"
WellXiong/TransTrack
Multiple-Object Tracking with Transformer
WellXiong/vision_transformer
WellXiong/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
WellXiong/ViT-pytorch-1
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
WellXiong/VSR-Transformer
VSR-Transformer