Pinned Repositories
ProML
code for "Semi-supervised Domain Adaptation via Prototype-based Multi-level Learning"
MediCLIP
Official implementation of "MediCLIP: Adapting CLIP for Few-shot Medical Image Anomaly Detection (MICCAI 2024 Early Accept)"
InternLM-XComposer
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
minimind-v
🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!
Basic-Visual-Language-Model
Build a simple basic multimodal large model from scratch. 从零搭建一个简单的基础多模态大模型🤖
minimind-v
「大模型」3小时从0训练27M参数的视觉多模态VLM,个人显卡即可推理训练!
XDU_ML_answer
2019级XDU齐飞老师机器学习答案
xinyanghuang7
Config files for my GitHub profile.
Monkey
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
OVMR
OVMR: Open-Vocabulary Recognition with Multi-Modal References (CVPR24)
xinyanghuang7's Repositories
xinyanghuang7/Basic-Visual-Language-Model
Build a simple basic multimodal large model from scratch. 从零搭建一个简单的基础多模态大模型🤖
xinyanghuang7/XDU_ML_answer
2019级XDU齐飞老师机器学习答案
xinyanghuang7/xinyanghuang7
Config files for my GitHub profile.
xinyanghuang7/minimind-v
「大模型」3小时从0训练27M参数的视觉多模态VLM,个人显卡即可推理训练!