Pinned Repositories
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
tsne-cuda
GPU Accelerated t-SNE for CUDA with Python bindings
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
google-research
Google Research
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
ovsam
[ECCV 2024] The official code of paper "Open-Vocabulary SAM".
detrex
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
DINO
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
VMamba
VMamba: Visual State Space Models,code is based on mamba
CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
nhw649's Repositories
nhw649 doesn’t have any repository yet.