jaychempan's Stars
ChangeZH/ALL2COCO
将所有目标检测数据集标签格式转为COCO标签的json格式。
ViTAE-Transformer/MTP
The official repo for [JSTARS'24] "MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining"
geoaigroup/awesome-vision-language-models-for-earth-observation
A curated list of awesome vision and language resources for earth observation.
voxel51/fiftyone
Refine high-quality datasets and visual AI models
openai/guided-diffusion
OpenGVLab/VideoMamba
[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding
SPDQ/Power-Plant-Detection-in-RSI
longzw1997/Open-GroundingDino
This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.
DeminYu98/DiffCast
[CVPR 2024] Official implementation of "DiffCast: A Unified Framework via Residual Diffusion for Precipitation Nowcasting"
zoubohao/DenoisingDiffusionProbabilityModel-ddpm-
This may be the simplest implement of DDPM. You can directly run Main.py to train the UNet on CIFAR-10 dataset and see the amazing process of denoising.
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
SxJyJay/Lumen
[NeurIPS 2024] Lumen: a Large multimodal model with versatile vision-centric capabilities
yatengLG/ISAT_with_segment_anything
Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具
zcablii/LSKNet
(IJCV2024 & ICCV2023) LSKNet: A Foundation Lightweight Backbone for Remote Sensing
YingWANGG/M2IB
Code for the paper Visual Explanations of Image–Text Representations via Multi-Modal Information Bottleneck Attribution
xai-org/grok-1
Grok open release
YangYimin98/AA-TransUNet
The repository for paper AA-TransUNet: Attention Augmented TransUNet For Nowcasting Tasks.
jaychempan/PIR-CLIP
📖 Official Code for “PIR-CLIP: Remote Sensing Image-text Retrieval with Prior Instruction Representation Learning”
sail-sg/MDT
Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)
shonenkov/CLIP-ODS
CLIP Object Detection, search object on image using natural language #Zeroshot #Unsupervised #CLIP #ODS
NTUYWANG103/clip-image-search
This code implements a versatile image search engine leveraging the CLIP model and FAISS, capable of processing both text-to-image and image-to-image queries.
Yuezhengrong/Image-text-search-engine
Image text search engine based on CLIP
csguoh/MambaIR
[ECCV2024] MambaIR and MambaIRv2!
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
state-spaces/mamba
Mamba SSM architecture
SiatMMLab/Awesome-Diffusion-Model-Based-Image-Editing-Methods
Diffusion Model-Based Image Editing: A Survey (arXiv)
KaiyangZhou/CoOp
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
WangZhenqing-RS/GF_preprocess
Ocean-Intelligent-Forecasting/XiHe-GlobalOceanForecasting
tyui592/awesome-precipitation-nowcasting
A list of Precipitation Nowcasting papers and related resources.