WasedaMagina's Stars
icoz69/StableLLAVA
Official repo for StableLLAVA
luogen1996/LaVIN
[NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"
janghyuncho/DECOLA
Code release for "Language-conditioned Detection Transformer"
jianghaojun/Awesome-Parameter-Efficient-Transfer-Learning
A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
jingyi0000/VLM_survey
Collection of AWESOME vision-language models for vision tasks
kzl/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
YiyangZhou/LURE
[ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Models
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
BradyFU/Woodpecker
✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models. The first work to correct hallucinations in MLLMs.
j-min/DSG
Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
mlzxy/devit
CoRL 2024
RunpeiDong/DreamLLM
[ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation
facebookresearch/contriever
Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning
meta-llama/llama
Inference code for Llama models
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
dvlab-research/LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
wl-zhao/VPD
[ICCV 2023] VPD is a framework that leverages the high-level and low-level knowledge of a pre-trained text-to-image diffusion model to downstream visual perception tasks.
wbbeyourself/SCM4LLMs
Self-Controlled Memory System for LLMs
kaixindelele/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
jzbjyb/ReAtt
Retrieval as Attention
ShuyangCao/hibrids_summ
Code for ACL 2022 paper "HIBRIDS: Attention with Hierarchical Biases for Structure-aware Long Document Summarization".
abertsch72/unlimiformer
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
open-mmlab/playground
A central hub for gathering and showcasing amazing projects that extend OpenMMLab with SAM and other exciting features.
fudan-zvg/SeaFormer
[ICLR 2023] SeaFormer: Squeeze-enhanced Axial Transformer for Mobile Semantic Segmentation
hustvl/TopFormer
TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation, CVPR2022
LightDXY/FT-CLIP
CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet