hzf-hhh's Stars
ultralytics/yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
xmu-xiaoma666/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
facebookresearch/Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
uber-research/UPSNet
UPSNet: A Unified Panoptic Segmentation Network
zhiqi-li/Panoptic-SegFormer
This is the official repo of Panoptic SegFormer [CVPR'22]
gyyang23/AFPN
li-xirong/cross-lingual-cap
Cross-lingual image captioning
mahaoyuHKU/pytorch-boat
This is an unofficial implementation of BOAT: Bilateral Local Attention Vision Transformer
tany0699/FMViT
xmu-xiaoma666/LSTNet
Towards Local Visual Modeling for Image Captioning
mrwu-mac/DIFNet
[CVPR 2022] This repository is for the paper ``DIFNet: Boosting Visual Information Flow for Image Captioning'' .
aimagelab/PMA-Net
With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning. ICCV 2023
weimingboya/DFT
CrossmodalGroup/CSA-Net
Young499/image-captioning-MDSANet
Pytorch implementation of paper "Multi-Branch Distance-Sensitive Self-Attention Network for Image Captioning".
TBI805/DSNT
Dual-Spatial Normalized Transformer for Image Captioning. Engineering Applications of Artificial Intelligence.
Lieberk/Chinese-IC-Baseline
A Chinese image captioning model is implemented based on PaddlePaddle framework