poopoobooy

poopoobooy's Stars

haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python18.3k2k
state-spaces/mamba
Mamba SSM architecture
Language:Python11.8k984
Blokkendoos/AACircuit
Pythonized AACircuit: Draw electronic circuits with ASCII characters.
Language:Python1457
mbzuai-oryx/Video-ChatGPT
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
Language:Python1.1k93
Luodian/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Language:Python3.5k241
Jingkang50/OpenOOD
Benchmarking Generalized Out-of-Distribution Detection
Language:Python79999