Chrisleowoo

Chrisleowoo's Stars

apple/ml-ferret
Language:Python8.3k484
mbzuai-oryx/groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
Language:Python73837
wenxi-yue/SurgicalSAM
[AAAI2024] Official implementation of SurgicalSAM
Language:Python689
med-air/EndoNeRF
Neural Rendering for Stereo 3D Reconstruction of Deformable Tissues in Robotic Surgery
Language:Python18317
GanjinZero/RAMM
Codes and Pre-trained models for RAMM: Retrieval-augmented Biomedical Visual Question Answering with Multi-modal Pre-training [ACM MM 2023]
Language:Python231
YiyangZhou/POVID
[Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning
Language:Python601
HongLiuuuuu/WSI-SAM
WSI-SAM
7
philip-mueller/lovt
Localized representation learning from Vision and Text (LoVT)
Language:Python264
qubvel-org/segmentation_models.pytorch
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
Language:Python9.3k1.6k
QtacierP/PRIOR
Official repository for the paper "Prototype Representation Joint Learning from Medical Images and Reports, ICCV 2023".
Language:Python605
yuan-12138/MI-SegNet
Language:Python105
lich0031/AIDE
AIDE: Annotation-efficient deep learning for automatic medical image segmentation
Language:Python5116
MrGiovanni/Dissertation
Zongwei Zhou's Ph.D. Dissertation
Language:TeX132
tianrun-chen/SAM-Adapter-PyTorch
Adapting Meta AI's Segment Anything to Downstream Tasks with Adapters and Prompts
Language:Python96483
OFA-Sys/OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Language:Python2.4k248
taokz/BiomedGPT
BiomedGPT: A Generalist Vision-Language Foundation Model for Diverse Biomedical Tasks
Language:Python44050
Yunkun-Zhang/Data-Centric-FM-Healthcare
A survey on data-centric foundation models in healthcare.
663
salesforce/ALBEF
Code for ALBEF: a new vision-language pre-training method
Language:Python1.5k193
CAMMA-public/SSG-VQA
SSG-VQA is a Visual Question Answering (VQA) dataset on laparoscopic videos providing diverse, geometrically grounded, unbiased and surgical action-oriented queries generated using scene graphs.
Language:Python281
CAMMA-public/Endoscapes
Official Repository for the Endoscapes Dataset for Surgical Scene Segmentation, Object Detection, and Critical View of Safety Assessment
291
sair-lab/cse473s23
Slides for CSE573 Intro to Computer Vision & Image Processing
71
hammoudhasan/SynthCLIP
Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.
Language:Python841
xulabs/aitom
AI for tomography
Language:Python13176
cfzd/Ultra-Fast-Lane-Detection
Ultra Fast Structure-aware Deep Lane Detection (ECCV 2020)
Language:Python1.8k492
longpeng2008/yousan.ai
Awesome resources of yousan.ai(closely related to deep learning).
Language:Python1.4k519
Minqi824/Overlap
Official code and data repository of "Anomaly Detection with Score Distribution Discrimination", KDD' 23.
Language:Python202