Chrisleowoo's Stars
apple/ml-ferret
mbzuai-oryx/groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
wenxi-yue/SurgicalSAM
[AAAI2024] Official implementation of SurgicalSAM
med-air/EndoNeRF
Neural Rendering for Stereo 3D Reconstruction of Deformable Tissues in Robotic Surgery
GanjinZero/RAMM
Codes and Pre-trained models for RAMM: Retrieval-augmented Biomedical Visual Question Answering with Multi-modal Pre-training [ACM MM 2023]
YiyangZhou/POVID
[Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning
HongLiuuuuu/WSI-SAM
WSI-SAM
philip-mueller/lovt
Localized representation learning from Vision and Text (LoVT)
qubvel-org/segmentation_models.pytorch
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
QtacierP/PRIOR
Official repository for the paper "Prototype Representation Joint Learning from Medical Images and Reports, ICCV 2023".
yuan-12138/MI-SegNet
lich0031/AIDE
AIDE: Annotation-efficient deep learning for automatic medical image segmentation
MrGiovanni/Dissertation
Zongwei Zhou's Ph.D. Dissertation
tianrun-chen/SAM-Adapter-PyTorch
Adapting Meta AI's Segment Anything to Downstream Tasks with Adapters and Prompts
OFA-Sys/OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
taokz/BiomedGPT
BiomedGPT: A Generalist Vision-Language Foundation Model for Diverse Biomedical Tasks
Yunkun-Zhang/Data-Centric-FM-Healthcare
A survey on data-centric foundation models in healthcare.
salesforce/ALBEF
Code for ALBEF: a new vision-language pre-training method
CAMMA-public/SSG-VQA
SSG-VQA is a Visual Question Answering (VQA) dataset on laparoscopic videos providing diverse, geometrically grounded, unbiased and surgical action-oriented queries generated using scene graphs.
CAMMA-public/Endoscapes
Official Repository for the Endoscapes Dataset for Surgical Scene Segmentation, Object Detection, and Critical View of Safety Assessment
sair-lab/cse473s23
Slides for CSE573 Intro to Computer Vision & Image Processing
hammoudhasan/SynthCLIP
Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.
xulabs/aitom
AI for tomography
cfzd/Ultra-Fast-Lane-Detection
Ultra Fast Structure-aware Deep Lane Detection (ECCV 2020)
longpeng2008/yousan.ai
Awesome resources of yousan.ai(closely related to deep learning).
Minqi824/Overlap
Official code and data repository of "Anomaly Detection with Score Distribution Discrimination", KDD' 23.