Pinned Repositories
AS-SGG
bottom_up_features_extract
An PyTorch reimplementation of bottom-up-attention models
COC
Code for ACMMM2024 paper COC.
COCOAPI_Visualization
cuda-cpp-c-compile
FLAN
Code for PR2024 paper FLAN.
GraphVQA
GraphVQA: Language-Guided Graph Neural Networks for Scene Graph Question Answering
HP
Code for BMVC2024 paper HP.
Priv_Labelimg
A labelimg tool concludes box,segmentation,instance keypoints, brush, human part features.
Privision
ZHUXUHAN's Repositories
ZHUXUHAN/bottom_up_features_extract
An PyTorch reimplementation of bottom-up-attention models
ZHUXUHAN/Python-Tools
ZHUXUHAN/a-PyTorch-Tutorial-to-Image-Captioning
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
ZHUXUHAN/Awesome-Visual-Captioning
This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP
ZHUXUHAN/cihp
ZHUXUHAN/DALI
A library containing both highly optimized building blocks and an execution engine for data pre-processing in deep learning applications
ZHUXUHAN/DaNet-3DHumanReconstruction
[TPAMI20] Learning 3D Human Shape and Pose from Dense Body Parts
ZHUXUHAN/Facial-Landmark-Detection
facial landmark detection
ZHUXUHAN/Grad-CAM.pytorch
pytorch实现Grad-CAM和Grad-CAM++,可以可视化任意分类网络的Class Activation Map (CAM)图,包括自定义的网络;同时也实现了目标检测faster r-cnn和retinanet两个网络的CAM图;欢迎试用、关注并反馈问题...
ZHUXUHAN/HOTR
Official repository for HOTR: End-to-End Human-Object Interaction Detection with Transformers (CVPR'21, Oral Presentation)
ZHUXUHAN/humanani
This code is for pose-guided human animation from a single image.
ZHUXUHAN/image-captioning
Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]
ZHUXUHAN/Image-Captions
BERT + Image Captioning
ZHUXUHAN/ImageCaptioning.pytorch
I decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch for archive)
ZHUXUHAN/Markdown-Resume-Template
BAT程序员自己的简历模板分享出来了 。技术简历追求简单明了,避免没有必要的花哨修饰,大家可以fork到自己仓库中,基于这个模板进行修改。
ZHUXUHAN/mxnet_backbone
ZHUXUHAN/OmniNet
Official Pytorch implementation of "OmniNet: A unified architecture for multi-modal multi-task learning" | Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain
ZHUXUHAN/One-shot-Human-Parsing
(AAAI 2021) Progressive One-shot Human Parsing
ZHUXUHAN/OneNet
What Makes for End-to-End Object Detection, ICML2021
ZHUXUHAN/openvqa
A lightweight, scalable, and general framework for visual question answering research
ZHUXUHAN/ovr-cnn
A new framework for open-vocabulary object detection, based on maskrcnn-benchmark
ZHUXUHAN/PMFNet
Implementation of "Pose-aware Multi-level Feature Network for Human Object Interaction Detection"(ICCV 2019 Oral)
ZHUXUHAN/RelationNet2
RelationNet++: Bridging Visual Representations for Object Detection via Transformer Decoder
ZHUXUHAN/SCAN
PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
ZHUXUHAN/Scene-Graph-Benchmark.pytorch
A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper “Unbiased Scene Graph Generation from Biased Training CVPR 2020”
ZHUXUHAN/SG2Caps
ZHUXUHAN/SMPL
NumPy, TensorFlow and PyTorch implementation of human body SMPL model and infant body SMIL model.
ZHUXUHAN/UNITER
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
ZHUXUHAN/visualization
a collection of visualization function
ZHUXUHAN/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch