ha-ov

ha-ov's Stars

WendellGul/DCMH
PyTorch implementation for paper "Deep Cross-Modal Hashing"
Language:Python10933
BruceW91/CVSE
The official source code for the paper Consensus-Aware Visual-Semantic Embedding for Image-Text Matching (ECCV 2020)
Language:Python17019
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Language:Jupyter Notebook4.7k624
jina-ai/clip-as-service
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
Language:Python12.4k2.1k
kuanghuei/SCAN
PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
Language:Python544113
google-research/vision_transformer
Language:Jupyter Notebook10.2k1.3k
dk-liang/Awesome-Visual-Transformer
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
3.4k395
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Language:Python20k3k
amzn/image-to-recipe-transformers
Code for CVPR 2021 paper: Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning
Language:Python8124
dandelin/ViLT
Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
Language:Python1.4k209
XMUNLP/Tagger
Deep Semantic Role Labeling with Self-Attention
Language:Python30586
Atmegal/DGCPN
Deep Graph-neighbor Coherence Preserving Network for Unsupervised Cross-modal Hashing
Language:Python3011
zhouyu1996/DAQN
An implement of our paper “DEEP ADVERSARIAL QUANTIZATION NETWORK FOR CROSS-MODAL RETRIEVAL”
Language:Python103
shivram1987/VisionTransformerHashing
Language:Python348
lukemelas/PyTorch-Pretrained-ViT
Vision Transformer (ViT) in PyTorch
Language:Python779124
facebookresearch/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Language:Python7.2k1.2k
sksq96/pytorch-summary
Model summary in PyTorch similar to `model.summary()` in Keras
Language:Python4k413
cocodataset/cocoapi
COCO API - Dataset @ http://cocodataset.org/
Language:Jupyter Notebook6.1k3.8k
WZMIAOMIAO/deep-learning-for-image-processing
deep learning for image processing including classification and object-detection etc.
Language:Python22.6k7.9k
akshitac8/BiAM
[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages
Language:Python5912
KaiserLew/JDSH
Joint-modal Distribution-based Similarity Hashing for Large-scale Unsupervised Deep Cross-modal Retrieval
Language:Python207
zs-zhong/DJSRH
The code for Deep Joint-Semantics Reconstructing Hashing for Large-Scale Unsupervised Cross-Modal Retrieval (ICCV 2019)
Language:Python7317
Huyp777/CMHN
Cross-Modal Hashing for Efﬁciently Retrieving Moments in Videos
Language:Python71
uta-smile/TCL
code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022
Language:Python25833
MIT-LCP/wfdb-python
Native Python WFDB package
Language:Jupyter Notebook738300
TorchSSL/TorchSSL
A PyTorch-based library for semi-supervised learning (NeurIPS'21)
Language:Python1.3k186
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Language:Python31.7k4.7k
yitu-opensource/T2T-ViT
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
Language:Jupyter Notebook1.1k177