ha-ov's Stars
ssarfraz/FINCH-Clustering
Source Code for FINCH Clustering Algorithm
itwanger/toBeBetterJavaer
一份通俗易懂、风趣幽默的Java学习指南,内容涵盖Java基础、Java并发编程、Java虚拟机、Java企业级开发、Java面试等核心知识点。学Java,就认准二哥的Java进阶之路😄
NiceSeason/gulimall-learning
2020谷粒商城代码+笔记
lilishop/lilishop
商城 JAVA电商商城 多语言商城 uniapp商城 微服务商城
luo3300612/Visualizer
assistant tools for attention visualization in deep learning
Snailclimb/JavaGuide
「Java学习+面试指南」一份涵盖大部分 Java 程序员所需要掌握的核心知识。准备 Java 面试,首选 JavaGuide!
FutureTwT/BSTH
The source code of "Bit-aware Semantic Transformer Hashing for Multi-modal Retrieval." (Accepted by SIGIR 2022)
amusi/CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
sangyun884/HR-VITON
Official PyTorch implementation for the paper High-Resolution Virtual Try-On with Misalignment and Occlusion-Handled Conditions (ECCV 2022).
WangGodder/deep-cross-modal-hashing
Deep learning cross modal hashing in PyTorch
BMC-SDNU/Hashing-Retrieval
Cross-Modal-Hashing-Retrieval/Multi-Modal-Hashing-Retrieval
bannedbook/fanqiang
翻墙-科学上网
yoshitomo-matsubara/torchdistill
A coding-free framework built on PyTorch for reproducible deep learning studies. 🏆25 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemented so far. 🎁 Trained models, training logs and configurations are available for ensuring the reproducibiliy and benchmark.
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
zhouyu1996/COCO4CMH
This repository is created to handle the original MS COCO-2014 dataset for cross-modal retrieval task.
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
microsoft/GLIP
Grounded Language-Image Pre-training
Paranioar/Awesome_Matching_Pretraining_Transfering
The Paper List of Large Multi-Modality Model, Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.
ArrowLuo/CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
ioanacroi/qb-norm
Cross Modal Retrieval with Querybank Normalisation
khoadoan106/single_loss_quantization
rdevon/DIM
Deep InfoMax (DIM), or "Learning Deep Representations by Mutual Information Estimation and Maximization"
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
LgQu/CAMERA
Context-Aware Multi-View Summarization Network for Image-Text Matching. ACM MM'20
KunpengLi1994/VSRN
PyTorch code for ICCV'19 paper "Visual Semantic Reasoning for Image-Text Matching"
zengyan-97/X-VLM
X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)
peteanderson80/bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
megvii-research/ML-GCN
PyTorch implementation of Multi-Label Image Recognition with Graph Convolutional Networks, CVPR 2019.
jiangqy/DCMH-CVPR2017
source code for paper "Deep Cross-Modal Hashing"
LgQu/DIME
Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21