justdolearning's Stars
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
ultralytics/ultralytics
Ultralytics YOLO11 🚀
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
open-mmlab/mmdetection
OpenMMLab Detection Toolbox and Benchmark
Mikubill/sd-webui-controlnet
WebUI extension for ControlNet
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
aleju/imgaug
Image augmentation for machine learning experiments.
PaddlePaddle/PaddleDetection
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
google-research/vision_transformer
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
facebookresearch/dino
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
OpenGVLab/DragGAN
Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux)
open-mmlab/mmpretrain
OpenMMLab Pre-training Toolbox and Benchmark
sail-sg/EditAnything
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
OFA-Sys/OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
alibaba/EasyCV
An all-in-one toolkit for computer vision
intelligent-machine-learning/dlrover
DLRover: An Automatic Distributed Deep Learning System
Fantasy-Studio/Paint-by-Example
Paint by Example: Exemplar-based Image Editing with Diffusion Models
megvii-research/mdistiller
The official implementation of [CVPR2022] Decoupled Knowledge Distillation https://arxiv.org/abs/2203.08679 and [ICCV2023] DOT: A Distillation-Oriented Trainer https://openaccess.thecvf.com/content/ICCV2023/papers/Zhao_DOT_A_Distillation-Oriented_Trainer_ICCV_2023_paper.pdf
baudm/parseq
Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)
LiheYoung/UniMatch
[CVPR 2023] Revisiting Weak-to-Strong Consistency in Semi-Supervised Semantic Segmentation
Tengfei-Wang/HFGI
CVPR 2022 HFGI: High-Fidelity GAN Inversion for Image Attribute Editing
FudanVI/FudanOCR
A toolbox of scene text super-resolution and recognition
mlzxy/devit
CoRL 2024
avanetten/yolt
You Only Look Twice: Rapid Multi-Scale Object Detection In Satellite Imagery
OpenGVLab/UniHCP
Official PyTorch implementation of UniHCP
grip-unina/TruFor
TruFor
HVision-NKU/CamoFormer
HighwayWu/FOCAL
Rethinking Image Forgery Detection and Localization