TheShadow29
Current: Applied Research Scientist in Surreal team at Meta. PhD@CS USC, BTech@EE IITB, intern at PRIOR AI2, Meta AI.
MetaSunnyvale, CA, USA
Pinned Repositories
awesome-grounding
awesome grounding: A curated list of research papers in visual grounding
FAI-notes
Some notes, tutorials, and some experimentation with the fast.ai library (https://github.com/fastai/fastai)
Ifood-challenge-2018
Trying the Ifood Challenge 2018
infnet-spen
TensorFlow implementation [ICLR 18] "Learning Approximate Inference Networks for Structured Prediction"
research-advice-list
A compilation of research advice.
Video-QAP
Repository for the paper Video Question Answering with Phrases via Semantic Roles
VidSitu
[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)
visual-commonsense-pytorch
For visual commonsense model
vognet-pytorch
[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
zsgnet-pytorch
Official implementation of ICCV19 oral paper Zero-Shot grounding of Objects from Natural Language Queries (https://arxiv.org/abs/1908.07129)
TheShadow29's Repositories
TheShadow29/awesome-grounding
awesome grounding: A curated list of research papers in visual grounding
TheShadow29/research-advice-list
A compilation of research advice.
TheShadow29/zsgnet-pytorch
Official implementation of ICCV19 oral paper Zero-Shot grounding of Objects from Natural Language Queries (https://arxiv.org/abs/1908.07129)
TheShadow29/vognet-pytorch
[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
TheShadow29/VidSitu
[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)
TheShadow29/Video-QAP
Repository for the paper Video Question Answering with Phrases via Semantic Roles
TheShadow29/ALBEF
Code for ALBEF: a new vision-language pre-training method
TheShadow29/bert_score
BERT score for text generation
TheShadow29/bpycv
Computer vision utils for Blender (generate instance annoatation, depth and 6D pose by one line code)
TheShadow29/ClipBERT
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.
TheShadow29/coco-caption
TheShadow29/coval
A coreference evaluation package for the CoNLL and ARRAU datasets
TheShadow29/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
TheShadow29/dotfiles
Some of my config files
TheShadow29/DownloadConceptualCaptions
Reliably download millions of images efficiently
TheShadow29/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
TheShadow29/fast-stable-diffusion
fast-stable-diffusion, +25-50% speed increase + memory efficient + DreamBooth
TheShadow29/GMED
Source code for "Gradient Based Memory Editing for Task-Free Continual Learning", 4th Lifelong ML Workshop@ICML 2020
TheShadow29/manim
A community-maintained Python framework for creating mathematical animations.
TheShadow29/manim-pptx
TheShadow29/mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
TheShadow29/mnemonics
PyTorch implementation of "Mnemonics Training: Multi-Class Incremental Learning without Forgetting" (CVPR2020 Oral)
TheShadow29/neptune-mlflow
Neptune integration with MLflow
TheShadow29/pycls
Codebase for Image Classification Research, written in PyTorch.
TheShadow29/pytorchvideo
A deep learning library for video understanding research.
TheShadow29/raiv-task
Repository to hold dataset for RAIV task
TheShadow29/SlowFast
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
TheShadow29/tabular_dae
TheShadow29/USCthesis
a LaTeX style for theses and dissertations at USC
TheShadow29/VisTR
[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers