rentainhe

Computer Vision Engineer in IDEA-CVR @IDEA-Research

IDEAShenzhen, China

Pinned Repositories

detrex
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
Language:Python2k 25 166212
Grounded-SAM-2
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
Language:Jupyter Notebook1.2k 10 51111
Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook15.2k 114 3901.4k
Grounding-DINO-1.5-API
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
Language:Python784 12 4522
GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Language:Python6.8k 42 305690
pytorch-distributed-training
Simple tutorials on Pytorch DDP training
Language:Python267 4 147
pytorch-pooling
Test different pooling method used in CNN for Computer Vision Task
Language:Python35 2 25
TRAR-VQA
[ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"
Language:Python65 3 718
visualization
a collection of visualization function
Language:Python387 2 841
ViT.pytorch
The Pytorch reimplementation of Vision Transformer
Language:Jupyter Notebook10 2 10

rentainhe's Repositories

rentainhe/Learn-Detectron2-From-Scratch
Detectron2 Learning Notes Sharing
9 5 10
rentainhe/knowledge-graph-visualization
knowledge graph system based on Neo4j and Vue
Language:Vue6 2 00
rentainhe/rentainhe.github.io
Personal homepage
Language:HTML2 0 0
rentainhe/Awesome-Anything
General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX
1 0 0
rentainhe/MambaOut
MambaOut: Do We Really Need Mamba for Vision?
Language:Python1 0 0
rentainhe/ollama
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
Language:Go1 0 0
rentainhe/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
1
rentainhe/Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences.
Language:Jupyter Notebook0 0
rentainhe/dataset-api
The ApolloScape Open Dataset for Autonomous Driving and its Application.
Language:Jupyter Notebook0 0
rentainhe/detectron2
Detectron2 is FAIR's next-generation platform for object detection, segmentation and other visual recognition tasks.
Language:Python1 0
rentainhe/DiffusionDet
PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
Language:Python0 0
rentainhe/DINO
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
Language:Python0 0
rentainhe/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python0 0
rentainhe/EVA
Exploring the Limits of Masked Visual Representation Learning at Scale (https://arxiv.org/abs/2211.07636)
Language:Python0 0
rentainhe/GroundingDINO
The official implementation of "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Language:Python0 0
rentainhe/InternImage
[CVPR 2023] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Language:Python0 0
rentainhe/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型
Language:Python0 0
rentainhe/learn-ddim
Denoising Diffusion Implicit Models
Language:Python0 0
rentainhe/learned-guided-diffusion
Learning Guided Diffusion
Language:Python0 0
rentainhe/object-intrinsics
(CVPR 2023) Seeing a Rose in Five Thousand Ways
Language:Python0 0
rentainhe/OpenSeeD
[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"
Language:Python0 0
rentainhe/pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
Language:Python1 0
rentainhe/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Language:Python0 0
rentainhe/Qwen2
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
Language:Shell0 0
rentainhe/rentainhe
2 03
rentainhe/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook0 0
rentainhe/Segment-Everything-Everywhere-All-At-Once
Official implementation of the paper "Segment Everything Everywhere All at Once"
Language:Python0 0
rentainhe/stable-diffusion
Language:Jupyter Notebook0 0
rentainhe/stable-diffusion-learned
Personal Learning Version
Language:Jupyter Notebook0 0
rentainhe/T-Rex
Detect and count any objects by visual prompting
Language:Python0 0