wulele2's Stars
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
NVIDIA/TensorRT
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
daquexian/onnx-simplifier
Simplify your onnx model
OpenGVLab/InternImage
[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
mit-han-lab/efficientvit
Efficient vision foundation models for high-resolution generation and perception.
isl-org/DPT
Dense Prediction Transformers
autodistill/autodistill
Images to inference with no labeling (use foundation models to train supervised models).
WXinlong/SOLO
SOLO and SOLOv2 for instance segmentation, ECCV 2020 & NeurIPS 2020.
ViTAE-Transformer/ViTPose
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
ZhangGe6/onnx-modifier
A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.
microsoft/SimMIM
This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".
xieenze/PolarMask
Code for 'PolarMask: Single Shot Instance Segmentation with Polar Representation'
qqlu/Entity
EntitySeg Toolbox: Towards Open-World and High-Quality Image Segmentation
lxtGH/Awesome-Segmentation-With-Transformer
[T-PAMI-2024] Transformer-Based Visual Segmentation: A Survey
NVIDIA-AI-IOT/nanosam
A distilled Segment Anything (SAM) model capable of running real-time with NVIDIA TensorRT
IDEA-Research/DN-DETR
[CVPR 2022 Oral] Official implementation of DN-DETR
Atten4Vis/ConditionalDETR
This repository is an official implementation of the ICCV 2021 paper "Conditional DETR for Fast Training Convergence". (https://arxiv.org/abs/2108.06152)
Jeff-sjtu/CrowdPose
CrowdPose: Efficient Crowded Scenes Pose Estimation and A New Benchmark, CVPR 2019, Oral
deepglint/unicom
MLCD & UNICOM : Large-Scale Visual Representation Model
impiga/Plain-DETR
[ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design
IDEA-Research/ED-Pose
[ICLR 2023] Official implementation of the paper "Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation "
OliverRensu/TinyMIM
amazon-science/polygon-transformer
murdockhou/Single-Stage-Multi-person-Pose-Machines
A tensorlfow implementation about arxiv paper "Single-Stage Multi-Person Pose Machines" (SPM)
AAboys/MobileFormer
Code and models for mobile-former
jaehyunnn/ViTPose_pytorch
An unofficial implementation of ViTPose [Y. Xu et al., 2022]
YuYue525/MobileSAM-pytorch
Reproduction of MobileSAM using pytorch
cherubicXN/logocap
Offical code of LOGO-CAP (CVPR' 22). https://arxiv.org/abs/2109.03622