Pinned Repositories
beta-tcvae
code for "Isolating Sources of Disentanglement in Variational Autoencoders".
cider
python codes for CIDEr - Consensus-based Image Caption Evaluation
coco-caption
corrective-unlearning-bench
crn
code for "Composed Image Retrieval via Cross Relation Network with Hierarchical Aggregation Transformer"
dataset-annotation-tool
A GUI annotation tool for fine-grained image-text pair dataset.
EmoLLM
EmoLLM: Multimodal Emotional Understanding Meets Large Language Models
Fewshot_Detection
Few-shot Object Detection via Feature Reweighting
IntCLIP
Repo for "Synergy of Sight and Semantics: Visual Intention Understanding with CLIP"
yan9qu's Repositories
yan9qu/EmoLLM
EmoLLM: Multimodal Emotional Understanding Meets Large Language Models
yan9qu/IntCLIP
Repo for "Synergy of Sight and Semantics: Visual Intention Understanding with CLIP"
yan9qu/crn
code for "Composed Image Retrieval via Cross Relation Network with Hierarchical Aggregation Transformer"
yan9qu/beta-tcvae
code for "Isolating Sources of Disentanglement in Variational Autoencoders".
yan9qu/cider
python codes for CIDEr - Consensus-based Image Caption Evaluation
yan9qu/coco-caption
yan9qu/corrective-unlearning-bench
yan9qu/dataset-annotation-tool
A GUI annotation tool for fine-grained image-text pair dataset.
yan9qu/Fewshot_Detection
Few-shot Object Detection via Feature Reweighting
yan9qu/MINE
yan9qu/MINE-dataset
repo for MINE: Multimodal IntentioN and Emotion Understanding in the Wild
yan9qu/pytorch-grad-cam
Many Class Activation Map methods implemented in Pytorch for CNNs and Vision Transformers. Including Grad-CAM, Grad-CAM++, Score-CAM, Ablation-CAM and XGrad-CAM
yan9qu/R2CNN_FPN_Tensorflow
R2CNN: Rotational Region CNN Based on FPN (Tensorflow)
yan9qu/ReID-Label-Noise
yan9qu/TTNet-Real-time-Analysis-System-for-Table-Tennis-Pytorch
Unofficial implementation of "TTNet: Real-time temporal and spatial video analysis of table tennis" (CVPR 2020)
yan9qu/yan9qu.github.io