zkysss11235

Zhejiang university

Pinned Repositories

cloud
CSCI2470 Deep Learning Spring 2024: Enhancing Out-of-Distribution Object Detection with CLIP: A Vision-Language Approach
Language:Python2 2 00
Endo-FM
[MICCAI'23] Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train
Language:Python165 3 2716
Gesture-Nauts
Language:Jupyter Notebook0 1 01
M2I2
This repository is made for the paper: Self-supervised vision-language pretraining for Medical visual question answering
Language:Python34 4 174
MMed-RAG
[arXiv'24 & NeurIPSW'24] MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models
Language:Python75 5 66
RULE
[EMNLP'24] RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models
Language:Python54 1 53
MedTrinity-25M
This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine“
Language:Python233 2 1717
POVID
[Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning
Language:Python77 3 143

zkysss11235/MagiskOnWSA
Integrate Magisk root and Google Apps (OpenGApps) into WSA (Windows Subsystem for Android)