uzkent's Stars
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
conwnet/github1s
One second to read GitHub code with VS Code.
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
openai/DALL-E
PyTorch package for the discrete VAE used for DALL·E.
google-research/vision_transformer
lukemelas/EfficientNet-PyTorch
A PyTorch implementation of EfficientNet
pliang279/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
facebookresearch/deit
Official DeiT repository
jcjohnson/cnn-benchmarks
Benchmarks for popular CNN models
whai362/PVT
Official implementation of PVT series
AgaMiko/data-augmentation-review
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.
lucidrains/perceiver-pytorch
Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch
JudyYe/zero-shot-gcn
Zero-Shot Learning with GCN (CVPR 2018)
yaohungt/Multimodal-Transformer
[ACL'19] [PyTorch] Multimodal Transformer
ChenRocks/UNITER
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
google-research-datasets/conceptual-captions
Conceptual Captions is a dataset containing (image-URL, caption) pairs designed for the training and evaluation of machine learned image captioning systems.
lichengunc/refer
Referring Expression Datasets API
jiweibo/ImageNet
This implements training of popular model architectures, such as AlexNet, ResNet and VGG on the ImageNet dataset(Now we supported alexnet, vgg, resnet, squeezenet, densenet)
wutong16/DistributionBalancedLoss
[ ECCV 2020 Spotlight ] Pytorch implementation for "Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed Datasets"
yinboc/DGP
Rethinking Knowledge Graph Propagation for Zero-Shot Learning, in CVPR 2019
ispc-lab/SmallObjectDetectionList
List of the Papers Addressing Vision-based Small Object Detection
vmurahari3/visdial-bert
Implementation for "Large-scale Pretraining for Visual Dialog" https://arxiv.org/abs/1912.02379
uzkent/EfficientObjectDetection
PyTorch Implementation of Efficient Object Detection in Large Images - [WACV20]
ermongroup/PatchDrop
PyTorch Implementation of `Learning to Process Fewer Pixels` - [CVPR20 (Oral)]
sustainlab-group/geography-aware-ssl
Official Repository for ICCV 2021 Paper Titled as "Geography-Aware Self-Supervised Learning"
sustainlab-group/ParcelDelineation
ermongroup/NDA
raeidsaqur/mgn
Multimodal Graph Network (MGN): Code repo, examples from the paper
sustainlab-group/mapillarygcn
Repo for Predicting Livelihood Indicators from Community-Generated Street-Level Imagery (AAAI21).