echo840

Pinned Repositories

Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
00
bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Language:Python00
lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
Language:Python00
Monkey
Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Language:Python00
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook0 0 00
Transformer-Explainability
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
Language:Jupyter Notebook00
VL-InterpreT
Visual Language Transformer Interpreter - An interactive visualization tool for interpreting vision-language transformers
Language:Python00
VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 30+ HF models, 15+ benchmarks
Language:Python00
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python17.9k 157 1.4k1.9k
bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
Language:Python5.7k 48 967583

echo840's Repositories

echo840/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
00
echo840/bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Language:Python00
echo840/lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
Language:Python00
echo840/Monkey
Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Language:Python00
echo840/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook0 0 00
echo840/Transformer-Explainability
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
Language:Jupyter Notebook00
echo840/VL-InterpreT
Visual Language Transformer Interpreter - An interactive visualization tool for interpreting vision-language transformers
Language:Python00
echo840/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 30+ HF models, 15+ benchmarks
Language:Python00