Pinned Repositories
CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
DAB-DETR
[ICLR 2022] DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR
DAHOI
DAHOI:Dynamic Anchor for Human-Object Interaction Detection
FGAHOI
GroundingDINO
The official implementation of "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
open_flamingo
An open-source framework for training large multimodal models.
ov-seg
This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.
OW-DETR
[CVPR 2022] Official Pytorch code for OW-DETR: Open-world Detection Transformer
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
SKDF
xiaomabufei's Repositories
xiaomabufei/FGAHOI
xiaomabufei/SKDF
xiaomabufei/DAHOI
DAHOI:Dynamic Anchor for Human-Object Interaction Detection
xiaomabufei/GroundingDINO
The official implementation of "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
xiaomabufei/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
xiaomabufei/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
xiaomabufei/DAB-DETR
[ICLR 2022] DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR
xiaomabufei/open_flamingo
An open-source framework for training large multimodal models.
xiaomabufei/ov-seg
This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.
xiaomabufei/OW-DETR
[CVPR 2022] Official Pytorch code for OW-DETR: Open-world Detection Transformer
xiaomabufei/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
xiaomabufei/deep-learning-for-image-processing
deep learning for image processing including classification and object-detection etc.
xiaomabufei/MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
xiaomabufei/mlc-chatbot
python interface for mlc chat cli
xiaomabufei/QAHOI
xiaomabufei/RITR
RITR Efficient One-Stage Detection of Human-Object Interaction with Dilation Transformer
xiaomabufei/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.