Pinned Repositories
cloud
CSCI2470 Deep Learning Spring 2024: Enhancing Out-of-Distribution Object Detection with CLIP: A Vision-Language Approach
Endo-FM
[MICCAI'23] Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train
Gesture-Nauts
M2I2
This repository is made for the paper: Self-supervised vision-language pretraining for Medical visual question answering
MMed-RAG
[arXiv'24 & NeurIPSW'24] MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models
RULE
[EMNLP'24] RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models
MedTrinity-25M
This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine“
POVID
[Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning
zkysss11235's Repositories
zkysss11235/MagiskOnWSA
Integrate Magisk root and Google Apps (OpenGApps) into WSA (Windows Subsystem for Android)