BigRedT
Research Scientist @ AI2 | Previously PhD @ UIUC and Undergrad @ IIT Kanpur
Allen Institute for Artificial IntelligenceSeattle
Pinned Repositories
codenav
CodeNav is an LLM agent that navigates and leverages previously unseen code repositories to solve user queries.
gpv-1
A task-agnostic vision-language architecture as a step towards General Purpose Vision
spoc-robot-training
SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World
visprog
Official code for VisProg (CVPR 2023 Best Paper!)
bottom-up-features
Bottom-up features extractor implemented in PyTorch.
DTAM
A copy of https://github.com/anuranbaka/OpenDTAM with a main cpp file for both tracking and mapping
info-ground
Learning phrase grounding from captioned images through InfoNCE bound on mutual information
no_frills_hoi_det
A strong HOI Detection model without Frills!
vico
Multi-sense word embeddings from visual co-occurrences
Wiener_Filter
Wiener Filtering for Noise Removal in Matlab
BigRedT's Repositories
BigRedT/info-ground
Learning phrase grounding from captioned images through InfoNCE bound on mutual information
BigRedT/no_frills_hoi_det
A strong HOI Detection model without Frills!
BigRedT/vico
Multi-sense word embeddings from visual co-occurrences
BigRedT/DTAM
A copy of https://github.com/anuranbaka/OpenDTAM with a main cpp file for both tracking and mapping
BigRedT/bottom-up-features
Bottom-up features extractor implemented in PyTorch.
BigRedT/Weighted_SGD
SGD with importance Sampling
BigRedT/crowdsource
Graphical Models and EM for Crowdsourcing
BigRedT/deep_income
BigRedT/RGBD_Segmentation
RGBD segmentation
BigRedT/nn_pred_surf
Visualize effects of neural net architectural choices on 2D data
BigRedT/pytorch-faster-rcnn
Fork of ruotianluo/pytorch-faster-rcnn with a simplified script to extract boxes, scores, features etc from any set of images and dump them in a directory
BigRedT/ViLT
Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
BigRedT/AdReal_Desktop
AdReal technology for Desktop
BigRedT/bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
BigRedT/Deep-learning-with-cats
Deep learning with cats (^._.^)
BigRedT/edge_boxes_with_python
A python wrapper for Edge Boxes object proposal generation
BigRedT/GenerativeImage2Text
GIT: A Generative Image-to-text Transformer for Vision and Language
BigRedT/gpv-1
A task-agnostic vision-language architecture as a step towards General Purpose Vision
BigRedT/ImageCaptioning.pytorch
Image captioning codebase in pytorch(finetunable cnn in branch "with_finetune";diverse beam search can be found in 'dbs' branch; self-critical training is under my self-critical.pytorch repository.)
BigRedT/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
BigRedT/PhotographicImageSynthesis
Photographic Image Synthesis with Cascaded Refinement Networks
BigRedT/py-faster-rcnn
Faster R-CNN (Python implementation) -- see https://github.com/ShaoqingRen/faster_rcnn for the official MATLAB version
BigRedT/pyAIUtils
Utility functions and classes for building Artificial Intelligence systems in Python
BigRedT/sfm_toolbox
BigRedT/SRL_RNN
BigRedT/taming-transformers
Taming Transformers for High-Resolution Image Synthesis
BigRedT/tango
Organize your experiments into discrete steps that can be cached and reused throughout the lifetime of your research project.
BigRedT/toolbox
Piotr's Image & Video Matlab Toolbox
BigRedT/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
BigRedT/VQA