Pinned Repositories
AS-SGG
bottom_up_features_extract
An PyTorch reimplementation of bottom-up-attention models
COC
Code for ACMMM2024 paper COC.
COCOAPI_Visualization
cuda-cpp-c-compile
FLAN
Code for PR2024 paper FLAN.
GraphVQA
GraphVQA: Language-Guided Graph Neural Networks for Scene Graph Question Answering
HP
Code for BMVC2024 paper HP.
Priv_Labelimg
A labelimg tool concludes box,segmentation,instance keypoints, brush, human part features.
Privision
ZHUXUHAN's Repositories
ZHUXUHAN/COC
Code for ACMMM2024 paper COC.
ZHUXUHAN/FLAN
Code for PR2024 paper FLAN.
ZHUXUHAN/HP
Code for BMVC2024 paper HP.
ZHUXUHAN/VQA2.0-Recent-Approachs-2018.pytorch
A pytroch reimplementation of "Bilinear Attention Network", "Intra- and Inter-modality Attention", "Learning Conditioned Graph Structures", "Learning to count object", "Bottom-up top-down" for Visual Question Answering 2.0
ZHUXUHAN/AS-SGG
ZHUXUHAN/BagofTricks-LT
A scientific and useful toolbox, which contains practical and effective long-tail related tricks with extensive experimental results
ZHUXUHAN/BalancedGroupSoftmax
CVPR 2020 oral paper: Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax.
ZHUXUHAN/cfvqa
[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias
ZHUXUHAN/classifier-balancing
This repository contains code for the paper "Decoupling Representation and Classifier for Long-Tailed Recognition", published at ICLR 2020
ZHUXUHAN/CLIP-Guided-Diffusion
Just playing with getting CLIP Guided Diffusion running locally, rather than having to use colab.
ZHUXUHAN/Coacher
ZHUXUHAN/cvpods
All-in-one Toolbox for Computer Vision Research.
ZHUXUHAN/detr
End-to-End Object Detection with Transformers
ZHUXUHAN/DRG
[ECCV 2020] DRG: Dual Relation Graph for Human-Object Interaction Detection
ZHUXUHAN/eql.detectron2
The official implementation of Equalization Loss for Long-Tailed Object Recognition (CVPR 2020) based on Detectron2. https://arxiv.org/abs/2003.05176
ZHUXUHAN/Game-Programmer-Study-Notes
:anchor: 我的游戏程序员生涯的读书笔记合辑。你可以把它看作一个加强版的Blog。涉及图形学、实时渲染、编程实践、GPU编程、设计模式、软件工程等内容。Keep Reading , Keep Writing , Keep Coding.
ZHUXUHAN/HOI-CL
Series of work (ECCV2020, CVPR2021, CVPR2021) about Compositional Learning for Human-Object Interaction Detection
ZHUXUHAN/ImgaeCaption.pytorch
ImgaeCaption.pytorch
ZHUXUHAN/migs
MIGS: Meta Image Generation from Scene Graphs (BMVC 2021)
ZHUXUHAN/ML-CV
机器学习实战
ZHUXUHAN/Multi-Modal-Transformer
The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and related datasets. Additionally, it also collects many useful tutorials and tools in these related domains.
ZHUXUHAN/my_segmentation
ZHUXUHAN/Oscar
Oscar and VinVL
ZHUXUHAN/segmentation-sg
Code Release for the paper Segmentation Grounded Scene Graph Generation
ZHUXUHAN/TextBlob
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
ZHUXUHAN/Transferable-Interactiveness-Network
Code for Transferable Interactiveness Knowledge for Human-Object Interaction Detection. (CVPR'19, TPAMI'21)
ZHUXUHAN/Transformer-MM-Explainability
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.
ZHUXUHAN/Transformer_model
A pytorch implementation of Attention Is All You Need (Transformer) for image captioning.
ZHUXUHAN/upt
Official PyTorch implementation for paper "Efficient Two-Stage Detection of Human–Object Interactions with a Novel Unary–Pairwise Transformer"
ZHUXUHAN/yolo_pytorch