ppengzeng's Stars
xmu-xiaoma666/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
JDAI-CV/fast-reid
SOTA Re-identification Methods and Toolbox
apache/singa
a distributed deep learning platform
iMoonLab/HGNN
Hypergraph Neural Networks (AAAI 2019)
JDAI-CV/centerX
This repo is implemented based on detectron2 and centernet
JDAI-CV/image-captioning
Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]
Paranioar/SGRAF
[AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”
BruceW91/CVSE
The official source code for the paper Consensus-Aware Visual-Semantic Embedding for Image-Text Matching (ECCV 2020)
xiaosu-zhu/Aurora.Music
Aurora Music
tgc1997/Awesome-Video-Captioning
A curated list of research papers in Video Captioning
xiaosu-zhu/McQuic
Repository of CVPR'22 paper "Unified Multivariate Gaussian Mixture for Efficient Neural Image Compression"
lxc86739795/human_vehicle_parsing_platform
A pytorch codebase for human parsing and vehicle parsing
zchoi/S2-Transformer
[IJCAI 2022] Official Pytorch code for paper “S2 Transformer for Image Captioning”
tgc1997/RMN
IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning
StephanieWyt/NMN
Source code and datasets for ACL 2020 paper: Neighborhood Matching Network for Entity Alignment.
yashkant/sam-textvqa
Official code for paper "Spatially Aware Multimodal Transformers for TextVQA" published at ECCV, 2020.
NovaMind-Z/PTSN
Repository for an end-to-end image captioning method PTSN(ACM MM22).
zchoi/PKOL
[TIP 2022] Official code of paper “Video Question Answering with Prior Knowledge and Object-sensitive Learning”
tzuhsial/pytorch-vqa-dan
A PyTorch implementation of Dual Attention Network
xiaosu-zhu/Aurora-Weather
Aurora Weather
ZhuGeKongKong/SGG-G2S
lixiangpengcs/Spatial-Temporal-Adaptive-Attention-for-Video-Captioning
Extension of hLSTMat
VL-Group/DPQ
brownwolf/3D-UVQ
Riesling00/HRNAT
Hierarchical Representation Network with AuxiliaryTasks for Video Captioning and Video QuestionAnswering
yyyanglz/KAN
Rich Visual Knowledge-based AugmentationNetwork for Visual Question Answering
op-multimodal/ACRTransformer
open source code for video question answering system.
VL-Group/DRQ
This is the code repository for Deep Recurrent Quantization for Generating Sequential Binary Codes
xiaosu-zhu/Dominant.Color
UWP app for palette generating