ayushchakravarthy's Stars
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
thu-ml/tianshou
An elegant PyTorch deep reinforcement learning library.
adam-maj/tiny-gpu
A minimal GPU design in Verilog to learn how GPUs work from the ground up
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
facebookresearch/ijepa
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."
One-2-3-45/One-2-3-45
[NeurIPS 2023] Official code of "One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization"
tysam-code/hlb-CIFAR10
Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)
NVlabs/BundleSDF
[CVPR 2023] BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects
ezelikman/quiet-star
Code for Quiet-STaR
dome272/MaskGIT-pytorch
Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)
ContextualAI/lens
This is the official repository for the LENS (Large Language Models Enhanced to See) system.
huangwl18/language-planner
Official Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"
mertyg/vision-language-models-are-bows
Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023
brendenlake/SCAN
Simple language-driven navigation tasks for studying compositional learning
brendenlake/MLC
Meta-Learning for Compositionality (MLC) for modeling human behavior
pairlab/SlotFormer
Code release for ICLR 2023 paper: SlotFormer on object-centric dynamics models
kanishkg/stream-of-search
Repository for the paper Stream of Search: Learning to Search in Language
singhgautam/slate
This is the official source code for SLATE. We provide the code for the model, the training code, and a dataset loader for the 3D Shapes dataset. This code is implemented in Pytorch.
addtt/object-centric-library
Library for the training and evaluation of object-centric models (ICML 2022)
i-machine-think/am-i-compositional
najoungkim/COGS
karazijal/clevrtex-generation
singhgautam/sysbinder
Official Code for Neural Systematic Binder
benpry/why-think-step-by-step
Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"
karazijal/clevrtex
ClevrTex: A Texture-Rich Benchmark for Unsupervised Multi-Object Segmentation
Gem5Vision/gem5-resources-website
Website for gem5 vision
rigved-sanku/3D-Scene-Reconstruction-via-Structure-from-Motion-SfM
Led the development of a Structure from Motion (SfM) pipeline for 3D scene reconstruction from 2D images at Unity Hall, employing SIFT with RANSAC for robust feature detection and optimizing Essential Matrix calculation via SVD.
rigved-sanku/NeRF-Implementation-for-Efficient-3D-Scene-Reconstruction
Led the development of an optimized Neural Radiance Fields (NeRF) implementation for 3D scene reconstruction, focusing on synthesizing novel views from sparse 2D images using advanced deep learning techniques.