shantanuj's Stars
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
facebookresearch/ConvNeXt
Code release for ConvNeXt model
google-research/scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
OFA-Sys/OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
KaihuaTang/Scene-Graph-Benchmark.pytorch
A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper “Unbiased Scene Graph Generation from Biased Training CVPR 2020”
ashkamath/mdetr
clip-vil/CLIP-ViL
[ICLR 2022] code for "How Much Can CLIP Benefit Vision-and-Language Tasks?" https://arxiv.org/abs/2107.06383
facebookresearch/mvit
Code Release for MViTv2 on Image Recognition.
cvlab-stonybrook/LearningToCountEverything
davidmascharka/tbd-nets
PyTorch implementation of "Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning"
KaihuaTang/VQA2.0-Recent-Approachs-2018.pytorch
A pytroch reimplementation of "Bilinear Attention Network", "Intra- and Inter-modality Attention", "Learning Conditioned Graph Structures", "Learning to count object", "Bottom-up top-down" for Visual Question Answering 2.0
ucbdrive/skipnet
Code for SkipNet: Learning Dynamic Routing in Convolutional Networks (ECCV 2018)
Tushar-N/blockdrop
BlockDrop: Dynamic Inference Paths in Residual Networks
soyeonm/FILM
Official repository of ICLR 2022 paper FILM: Following Instructions in Language with Modular Methods
ronghanghu/lcgn
Code release for Hu et al., Language-Conditioned Graph Networks for Relational Reasoning. in ICCV, 2019
mods333/energy-based-scene-graph
Code release for Energy-Based Learning for Scene Graph Genertaion
SHTUPLUS/PySGG
The toolkit for scene graph generation
aykutfirat/pyTorchTree
Recursive Neural Network and Tree LSTM implementations in pyTorch for sentiment analysis
google-deepmind/dm_fast_mapping
tomerwolgithub/Break
daqingliu/NMTree
Code release for Learning to Assemble Neural Module Tree Networks for Visual Grounding (ICCV 2019)
CAS-CLab/DGNet
Dynamic Dual Gating Neural Networks, ICCV 2021
timqqt/pytorch-zoom-in-network
Source code for "Efficient Classification of Very Large Images with Tiny Objects"
ronghanghu/gqa_single_hop_baseline
A simple but well-performing "single-hop" visual attention model for the GQA dataset
irwinherrmann/stochastic-gates
Codebase for "Channel selection using Gumbel Softmax"
aurooj/WSG-VQA-VLTransformers
Weakly Supervised Grounding for VQA in Vision-Language Transformers
AntoineSimoulin/pytree
Implementation of tree-structured neural networks in PyTorch.
ceyzaguirre4/DACT-MAC
Repository for hosting the code for the CVPR 2020 paper Differentiable Adaptive Computation Time for Visual Reasoning.
jingchenchen/ReasoningConsistency-VQA
manoja328/tallyqacode
Official Code for "TallyQA: Answering Complex Counting Questions" published at AAAI 2018