msingh27's Stars
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
lllyasviel/ControlNet
Let us control diffusion models!
facebookresearch/detr
End-to-End Object Detection with Transformers
google-deepmind/deepmind-research
This repository contains implementations and illustrative code to accompany DeepMind publications
facebookresearch/dino
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
dmlc/gluon-cv
Gluon CV Toolkit
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
AiuniAI/Unique3D
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
xinghaochen/awesome-hand-pose-estimation
Awesome work on hand pose estimation/tracking
rail-berkeley/rlkit
Collection of reinforcement learning algorithms
gordicaleksa/pytorch-GAT
My implementation of the original GAT paper (Veličković et al.). I've additionally included the playground.py file for visualizing the Cora dataset, GAT embeddings, an attention mechanism, and entropy histograms. I've supported both Cora (transductive) and PPI (inductive) examples!
adobe-research/custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
autonomousvision/occupancy_networks
This repository contains the code for the paper "Occupancy Networks - Learning 3D Reconstruction in Function Space"
Farama-Foundation/D4RL
A collection of reference environments for offline reinforcement learning
lucidrains/perceiver-pytorch
Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch
CLAY-3D/OpenCLAY
CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets
Epiphqny/VisTR
[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers
bertjiazheng/awesome-scene-understanding
😎 A list of awesome scene understanding papers.
facebookresearch/3detr
Code & Models for 3DETR - an End-to-end transformer model for 3D object detection
pengHTYX/Era3D
ptrblck/pytorch_misc
Code snippets created for the PyTorch discussion board
timmeinhardt/trackformer
Implementation of "TrackFormer: Multi-Object Tracking with Transformers”. [Conference on Computer Vision and Pattern Recognition (CVPR), 2022]
daerduoCarey/partnet_dataset
PartNet Dataset Official Release Repo
HaozhiQi/RPIN
Learning Long-term Visual Dynamics with Region Proposal Interaction Networks (ICLR 2021)
nickgkan/butd_detr
Code for the ECCV22 paper "Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds"
google-research/robodesk
CVMI-Lab/DODA
(ECCV 2022) DODA: Data-oriented Sim-to-Real Domain Adaptation for 3D Semantic Segmentation
rohitrango/Adversarial-Pose-Estimation
A PyTorch implementation of the paper 'Adversarial PoseNet: A Structure-aware Convolutional Network for Human Pose Estimation' (https://arxiv.org/pdf/1705.00389v2.pdf)
manuelli/key_dynam
Code for paper "Keypoints into the Future: Self-Supervised Correspondence in Model-Based RL"
nupurkmr9/Attributional-Robustness