TheShadow29

Current: Applied Research Scientist in Surreal team at Meta. PhD@CS USC, BTech@EE IITB, intern at PRIOR AI2, Meta AI.

MetaSunnyvale, CA, USA

Pinned Repositories

awesome-grounding
awesome grounding: A curated list of research papers in visual grounding
1.1k 29 6102
FAI-notes
Some notes, tutorials, and some experimentation with the fast.ai library (https://github.com/fastai/fastai)
Language:Jupyter Notebook58 3 315
Ifood-challenge-2018
Trying the Ifood Challenge 2018
Language:Jupyter Notebook17 5 02
infnet-spen
TensorFlow implementation [ICLR 18] "Learning Approximate Inference Networks for Structured Prediction"
Language:Python30 4 11
research-advice-list
A compilation of research advice.
218 7 017
Video-QAP
Repository for the paper Video Question Answering with Phrases via Semantic Roles
Language:Python4 2 10
VidSitu
[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)
Language:Python61 2 218
visual-commonsense-pytorch
For visual commonsense model
Language:Jupyter Notebook34 3 12
vognet-pytorch
[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
Language:Python67 3 87
zsgnet-pytorch
Official implementation of ICCV19 oral paper Zero-Shot grounding of Objects from Natural Language Queries (https://arxiv.org/abs/1908.07129)
Language:Python71 3 1112

TheShadow29's Repositories

TheShadow29/awesome-grounding
awesome grounding: A curated list of research papers in visual grounding
1.1k 29 6102
TheShadow29/research-advice-list
A compilation of research advice.
218 7 017
TheShadow29/zsgnet-pytorch
Official implementation of ICCV19 oral paper Zero-Shot grounding of Objects from Natural Language Queries (https://arxiv.org/abs/1908.07129)
Language:Python71 3 1112
TheShadow29/vognet-pytorch
[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
Language:Python67 3 87
TheShadow29/VidSitu
[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)
Language:Python61 2 218
TheShadow29/Video-QAP
Repository for the paper Video Question Answering with Phrases via Semantic Roles
Language:Python4 2 10
TheShadow29/ALBEF
Code for ALBEF: a new vision-language pre-training method
Language:Python1 0 0
TheShadow29/bert_score
BERT score for text generation
Language:Jupyter Notebook1 1 0
TheShadow29/bpycv
Computer vision utils for Blender (generate instance annoatation, depth and 6D pose by one line code)
Language:Python0 0
TheShadow29/ClipBERT
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.
Language:Python0 0
TheShadow29/coco-caption
Language:Jupyter Notebook1 0
TheShadow29/coval
A coreference evaluation package for the CoNLL and ARRAU datasets
Language:Python1 0
TheShadow29/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Language:Python1 0
TheShadow29/dotfiles
Some of my config files
Language:Shell1 0
TheShadow29/DownloadConceptualCaptions
Reliably download millions of images efficiently
Language:Jupyter Notebook0 0
TheShadow29/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python0 0
TheShadow29/fast-stable-diffusion
fast-stable-diffusion, +25-50% speed increase + memory efficient + DreamBooth
Language:Python1 0
TheShadow29/GMED
Source code for "Gradient Based Memory Editing for Task-Free Continual Learning", 4th Lifelong ML Workshop@ICML 2020
Language:Python0 0
TheShadow29/manim
A community-maintained Python framework for creating mathematical animations.
Language:Python0 0
TheShadow29/manim-pptx
Language:Python1 0
TheShadow29/mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Language:Python0 0
TheShadow29/mnemonics
PyTorch implementation of "Mnemonics Training: Multi-Class Incremental Learning without Forgetting" (CVPR2020 Oral)
2 0
TheShadow29/neptune-mlflow
Neptune integration with MLflow
Language:Python0 0
TheShadow29/pycls
Codebase for Image Classification Research, written in PyTorch.
Language:Python1 0
TheShadow29/pytorchvideo
A deep learning library for video understanding research.
Language:Python0 0
TheShadow29/raiv-task
Repository to hold dataset for RAIV task
1 0
TheShadow29/SlowFast
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Language:Python0 0
TheShadow29/tabular_dae
Language:Python0 0
TheShadow29/USCthesis
a LaTeX style for theses and dissertations at USC
Language:TeX1 0
TheShadow29/VisTR
[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers
Language:Python0 0