wulele2

wulele2's Stars

IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook15.1k 114 3881.4k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python14k 120 1.1k1.3k
NVIDIA/TensorRT
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
Language:C++10.7k 157 3.8k2.1k
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Language:Jupyter Notebook5.2k 62 390337
daquexian/onnx-simplifier
Simplify your onnx model
Language:C++3.8k 51 306383
OpenGVLab/InternImage
[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Language:Python2.5k 34 264233
mit-han-lab/efficientvit
Efficient vision foundation models for high-resolution generation and perception.
Language:Python2.2k 38 135182
isl-org/DPT
Dense Prediction Transformers
Language:Python2k 43 82258
autodistill/autodistill
Images to inference with no labeling (use foundation models to train supervised models).
Language:Python1.9k 21 101155
WXinlong/SOLO
SOLO and SOLOv2 for instance segmentation, ECCV 2020 & NeurIPS 2020.
Language:Python1.7k 32 239306
ViTAE-Transformer/ViTPose
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
Language:Python1.4k 20 140186
ZhangGe6/onnx-modifier
A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.
Language:JavaScript1.3k 12 103165
microsoft/SimMIM
This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".
Language:Python924 23 4287
xieenze/PolarMask
Code for 'PolarMask: Single Shot Instance Segmentation with Polar Representation'
Language:Python874 36 74158
qqlu/Entity
EntitySeg Toolbox: Towards Open-World and High-Quality Image Segmentation
Language:Jupyter Notebook696 22 4358
lxtGH/Awesome-Segmentation-With-Transformer
[T-PAMI-2024] Transformer-Based Visual Segmentation: A Survey
687 10 547
NVIDIA-AI-IOT/nanosam
A distilled Segment Anything (SAM) model capable of running real-time with NVIDIA TensorRT
Language:Python649 9 3058
IDEA-Research/DN-DETR
[CVPR 2022 Oral] Official implementation of DN-DETR
Language:Python543 16 6862
Atten4Vis/ConditionalDETR
This repository is an official implementation of the ICCV 2021 paper "Conditional DETR for Fast Training Convergence". (https://arxiv.org/abs/2108.06152)
Language:Python366 8 3350
Jeff-sjtu/CrowdPose
CrowdPose: Efficient Crowded Scenes Pose Estimation and A New Benchmark, CVPR 2019, Oral
Language:Python310 9 2338
deepglint/unicom
MLCD & UNICOM : Large-Scale Visual Representation Model
Language:Python276 7 2118
impiga/Plain-DETR
[ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design
Language:Python192 14 254
IDEA-Research/ED-Pose
[ICLR 2023] Official implementation of the paper "Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation "
Language:Python153 3 2910
OliverRensu/TinyMIM
Language:Python150 2 117
amazon-science/polygon-transformer
Language:Python131 11 289
murdockhou/Single-Stage-Multi-person-Pose-Machines
A tensorlfow implementation about arxiv paper "Single-Stage Multi-Person Pose Machines" (SPM)
Language:Python129 7 1318
AAboys/MobileFormer
Code and models for mobile-former
Language:Python119 5 1918
jaehyunnn/ViTPose_pytorch
An unofficial implementation of ViTPose [Y. Xu et al., 2022]
Language:Jupyter Notebook103 1 1821
YuYue525/MobileSAM-pytorch
Reproduction of MobileSAM using pytorch
Language:Python94 1 1916
cherubicXN/logocap
Offical code of LOGO-CAP (CVPR' 22). https://arxiv.org/abs/2109.03622
Language:Python33 4 57