WasedaMagina

WasedaMagina's Stars

icoz69/StableLLAVA
Official repo for StableLLAVA
Language:Python9310
luogen1996/LaVIN
[NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"
Language:Python51338
janghyuncho/DECOLA
Code release for "Language-conditioned Detection Transformer"
Language:Python854
jianghaojun/Awesome-Parameter-Efficient-Transfer-Learning
A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.
39325
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python16.8k1.7k
jingyi0000/VLM_survey
Collection of AWESOME vision-language models for vision tasks
2.7k227
kzl/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
Language:Python2.4k455
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
Language:Python8.1k591
YiyangZhou/LURE
[ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Models
Language:Python1395
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python37.4k4.6k
BradyFU/Woodpecker
✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models. The first work to correct hallucinations in MLLMs.
Language:Python62230
j-min/DSG
Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)
Language:Jupyter Notebook795
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.7k6.4k
mlzxy/devit
CoRL 2024
Language:Python36046
RunpeiDong/DreamLLM
[ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation
Language:Python4057
facebookresearch/contriever
Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning
Language:Python70260
meta-llama/llama
Inference code for Llama models
Language:Python57k9.6k
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
7k414
dvlab-research/LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
Language:Python2k130
wl-zhao/VPD
[ICCV 2023] VPD is a framework that leverages the high-level and low-level knowledge of a pre-trained text-to-image diffusion model to downstream visual perception tasks.
Language:Jupyter Notebook51831
wbbeyourself/SCM4LLMs
Self-Controlled Memory System for LLMs
Language:Jupyter Notebook4211
kaixindelele/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
Language:Python18.6k1.9k
jzbjyb/ReAtt
Retrieval as Attention
Language:Python834
ShuyangCao/hibrids_summ
Code for ACL 2022 paper "HIBRIDS: Attention with Hierarchical Biases for Structure-aware Long Document Summarization".
Language:Python12
abertsch72/unlimiformer
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"
Language:Python1.1k81
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.5k2.9k
open-mmlab/playground
A central hub for gathering and showcasing amazing projects that extend OpenMMLab with SAM and other exciting features.
Language:Python1.1k122
fudan-zvg/SeaFormer
[ICLR 2023] SeaFormer: Squeeze-enhanced Axial Transformer for Mobile Semantic Segmentation
Language:Python30123
hustvl/TopFormer
TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation, CVPR2022
Language:Python38742
LightDXY/FT-CLIP
CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet
Language:Python2118