Pinned Repositories
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
Woodpecker
✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models. The first work to correct hallucinations in MLLMs.
llm-transparency-tool
LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/llm-transparency-tool-demo
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
ACGNet
An unofficial Pytorch implementation of ACGNet
AUW-GCN
[ICME-2023] Official Pytorch implementation of AU-aware graph convolutional network for Macro- and Micro-expression spotting
BGM-Net
[ACM TOMM-2024] Official Pytorch implementation of "Exploiting Instance-level Relationships in Weakly Supervised Text-to-Video Retrieval"
xjtupanda's Repositories
xjtupanda/AUW-GCN
[ICME-2023] Official Pytorch implementation of AU-aware graph convolutional network for Macro- and Micro-expression spotting
xjtupanda/ACGNet
An unofficial Pytorch implementation of ACGNet
xjtupanda/BGM-Net
[ACM TOMM-2024] Official Pytorch implementation of "Exploiting Instance-level Relationships in Weakly Supervised Text-to-Video Retrieval"