unolop

unolop's Stars

meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27.3k 228 2653.1k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python20.5k 159 1.6k2.3k
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook15.5k 194 3832.2k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
12.9k 272 125824
sashabaranov/go-openai
OpenAI ChatGPT, GPT-3, GPT-4, DALL·E, Whisper API wrapper for Go
Language:Go9.3k 73 3501.4k
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Language:Python5.1k 49 453386
InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Language:Python4k 35 537316
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
Language:Python3.8k 48 176286
Luodian/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Language:Python3.6k 100 163243
LLaVA-VL/LLaVA-NeXT
Language:Python3k 36 315258
jbmouret/matplotlib_for_papers
Handout for the tutorial "Creating publication-quality figures with matplotlib"
Language:Jupyter Notebook2.1k 72 7301
CSAILVision/places365
The Places365-CNNs for Scene Classification
Language:Python1.9k 58 94537
rgeirhos/texture-vs-shape
Pre-trained models, data, code & materials from the paper "ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness" (ICLR 2019 Oral)
Language:R790 19 30102
AILab-CVC/SEED
Official implementation of SEED-LLaMA (ICLR 2024).
Language:Python584 15 5032
SHI-Labs/VCoder
VCoder: Versatile Vision Encoders for Multimodal Large Language Models, arXiv 2023 / CVPR 2024
Language:Python264 9 815
yunqing-me/AttackVLM
[NeurIPS-2023] Annual Conference on Neural Information Processing Systems
Language:Python163 2 228
PhoenixZ810/MG-LLaVA
Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).
Language:Python147 1 104
zhoubolei/places_devkit
Development kit for the data of the Places365-Standard and Places365-Challenge
Language:MATLAB122 8 946
chs20/RobustVLM
[ICML 2024] Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models
Language:Python102 6 73
LabForComputationalVision/pyrtools
image pyramid code in python 3
Language:Jupyter Notebook75 12 1221
infly-ai/INF-MLLM
Language:Python53 4 72
yuhui-zh15/VLMClassifier
Official implementation of "Why are Visually-Grounded Language Models Bad at Image Classification?" (NeurIPS 2024)
Language:Jupyter Notebook53 3 43
heliossun/SQ-LLaVA
Visual self-questioning for large vision-language assistant.
Language:Python32 2 63
IemProg/CoFiMA
🔥 🔥 [ECCV 2024 Oral ] Official code for "Weighted Ensemble Models Are Strong Continual Learners"
Language:Python20 2 31
HaohanWang/PAR_experiments
Learning Robust Global Representations by Penalizing Local Predictive Power (NeurIPS 2019))
Language:Python18 5 23
ajaysub110/critical-band-masking
Code for the NeurIPS 2023 paper "Spatial-frequency channels, shape bias, and adversarial robustness"
Language:Jupyter Notebook11 1 22
paulgavrikov/biases_vs_generalization
Official code for the CVPR 2024 Paper "Can Biases in ImageNet Models Explain Generalization?".
Language:Jupyter Notebook11 2 12
BasicCoder/SketchClassification
Pytorch Sketch Classification
Language:Python8 1 33
PKU-RL/COPL
Visual Grounding for Object-Level Generalization in Reinforcement Learning (ECCV 2024)
Language:Python5 1 01
kailasdayanandan/dual_thinking
Language:Jupyter Notebook1 1 0