affromero's Stars
segments-ai/panoptic-segment-anything
Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
yoxu515/aot-benchmark
An efficient modular implementation of Associating Objects with Transformers for Video Object Segmentation in PyTorch
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
AILab-CVC/SEED
Official implementation of SEED-LLaMA (ICLR 2024).
facebookresearch/co-tracker
CoTracker is a model for tracking any point (pixel) on a video.
ChaoningZhang/MobileSAM
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
pixelite1201/BEDLAM
PerceivingSystems/bedlam_render
BEDLAM (CVPR 2023) render pipeline tools
hellock/icrawler
A multi-thread crawler framework with many builtin image crawlers provided.
voxel51/fiftyone
The open-source tool for building high-quality datasets and computer vision models
CASIA-IVA-Lab/FastSAM
Fast Segment Anything
SizheAn/PanoHead
Code Repository for CVPR 2023 Paper "PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360 degree"
TimDettmers/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
facebookresearch/localrf
An algorithm for reconstructing the radiance field of a large-scale scene from a single casually captured video.
ssundaram21/dreamsim
DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data (NeurIPS 2023 Spotlight)
xiaobai1217/Awesome-Video-Datasets
Video datasets
TencentARC/Mix-of-Show
NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
Corleone-Huang/DynamicVectorQuantization
BrandonHanx/HeadSculpt
[NeurIPS 2023] HeadSculpt: Crafting 3D Head Avatars with Text
shubham-goel/4D-Humans
4DHumans: Reconstructing and Tracking Humans with Transformers
mshahbazi72/NeRF-GAN-Distillation
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
SysCV/sam-hq
Segment Anything in High Quality [NeurIPS 2023]
zju3dv/AutoRecon
Code for "AutoRecon: Automated 3D Object Discovery and Reconstruction" CVPR 2023 (Highlight)
ZoneLikeWonderland/HACK-Model
threestudio-project/threestudio
A unified framework for 3D content generation.
yiqun-wang/PET-NeuS
PET-NeuS: Positional Encoding Tri-Planes for Neural Surfaces (CVPR 2023)
brentyi/tyro
Zero-effort CLI interfaces & config objects, from types