lll2343

Pinned Repositories

fluid
Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)
Language:Go1.7k 32 1.1k961
AVSegFormer
[AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer
Language:Python0 0 00
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
0 0 00
detr
End-to-End Object Detection with Transformers
Language:Python0 0 00
mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Language:Python0 0 00
MYSCU_01
云上川大测试
Language:JavaScript4 1 00
MMFuser
The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". MMFuser addresses the limitations of current MLLMs in capturing complex image details by simply yet efficiently integrating multi-layer features from ViTs.
Language:Python33 1 04
MMInstruct
The official implementation of the paper "MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity". The MMInstruct dataset includes 973K instructions from 24 domains and four instruction types.
Language:Python33 4 12

lll2343's Repositories

lll2343/AVSegFormer
[AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer
Language:Python0 0 00
lll2343/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
0 0 00
lll2343/detr
End-to-End Object Detection with Transformers
Language:Python0 0 00
lll2343/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Language:Python0 0 00