Pinned Repositories
detrex
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
Grounded-SAM-2
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Grounding-DINO-1.5-API
Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
pytorch-distributed-training
Simple tutorials on Pytorch DDP training
pytorch-pooling
Test different pooling method used in CNN for Computer Vision Task
TRAR-VQA
[ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"
visualization
a collection of visualization function
ViT.pytorch
The Pytorch reimplementation of Vision Transformer
rentainhe's Repositories
rentainhe/pytorch-pooling
Test different pooling method used in CNN for Computer Vision Task
rentainhe/config-builder
a list of config-builder repo and tutorials which may help you to build your own config file
rentainhe/x-classification
a framework for image classification based on pytorch
rentainhe/pytorch-models
Computer vision models on Pytorch
rentainhe/knowledge-graph-backend
the backend of knowledge graph system based on Springboot
rentainhe/deep-learning-knowledge
A collection of cv-interview problems and answers
rentainhe/paper-reading
rentainhe/plug-and-play-blocks
Some APIs that can be used directly for deep learning
rentainhe/AutoTorch
AutoTorch, A HPO Toolkit
rentainhe/Components
some useful module based on Pytorch
rentainhe/Detection-From-Scratch
detection paper notes and code
rentainhe/faster-rcnn-pytorch
build faster rcnn on pytorch from scratch
rentainhe/grid-lxmert
PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".
rentainhe/interview-algorithm-collection
A collection of interview algorithm problems and solutions
rentainhe/ISTR
ISTR: End-to-End Instance Segmentation with Transformers (https://arxiv.org/abs/2105.00637)
rentainhe/learning-notes-sharing
Share my learning notes in this repo
rentainhe/mcan-vqa
Deep Modular Co-Attention Networks for Visual Question Answering
rentainhe/mini-vqa
A mini-platform for visual question answering research
rentainhe/nni
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
rentainhe/python-basic-knowledge
basic knowledge of python ( personal notes )
rentainhe/python-interviews
Python经典面试题(python面试的复习参考资料)(持续更新中...)
rentainhe/python_interview_question
关于python的面试题
rentainhe/pytorch-extending
rentainhe/simple-faster-rcnn-pytorch
A simplified implemention of Faster R-CNN that replicate performance from origin paper
rentainhe/Tasks_for_Rookies
视觉实验室新手任务
rentainhe/vision-transformer-pytorch
Pytorch version of Vision Transformer (ViT) with pretrained models. This is part of CASL (https://casl-project.github.io/) and ASYML project.