FuxiaoLiu

Hi! I'm a 3rd-year CS Ph.D at University of Maryland, College Park, working with Abhinav Shrivastava and Yaser Yacoob.

Pinned Repositories

awesome-Large-MultiModal-Hallucination
😎 up-to-date & curated list of awesome LMM hallucinations papers, methods & resources.
2 0 00
DocumentCLIP
[ICPRAI 2024] DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents
Language:Python16 4 00
HallusionBench
[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
3 2 00
Large-Multimodal-Hallucination
3 1 00
LRV-Instruction
[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
Language:Python272 12 2413
MMC
[NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning
Language:Python95 6 184
Twitter-Video-dataset
[EACL'23] COVID-VTS: Fact Extraction and Verification on Short Video Platforms
10 1 10
VisualNews-Repository
[EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning
Language:Jupyter Notebook93 14 69
EAGLE
Eagle Family: Exploring Model Designs, Data Recipes and Training Strategies for Frontier-Class Multimodal LLMs
Language:Python623 26 2539
HallusionBench
[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
Language:Python270 5 138

FuxiaoLiu's Repositories

FuxiaoLiu/LRV-Instruction
[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
Language:Python272 12 2413
FuxiaoLiu/MMC
[NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning
Language:Python95 6 184
FuxiaoLiu/VisualNews-Repository
[EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning
Language:Jupyter Notebook93 14 69
FuxiaoLiu/DocumentCLIP
[ICPRAI 2024] DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents
Language:Python16 4 00
FuxiaoLiu/Twitter-Video-dataset
[EACL'23] COVID-VTS: Fact Extraction and Verification on Short Video Platforms
10 1 10
FuxiaoLiu/HallusionBench
[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
3 2 00
FuxiaoLiu/Large-Multimodal-Hallucination
3 1 00
FuxiaoLiu/awesome-Large-MultiModal-Hallucination
😎 up-to-date & curated list of awesome LMM hallucinations papers, methods & resources.
2 0 00
FuxiaoLiu/EAGLE
EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Language:Python1 0 0
FuxiaoLiu/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
00
FuxiaoLiu/calvinliu123
Language:Jupyter Notebook0 1 00
FuxiaoLiu/calvinliu123.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript0 0 00
FuxiaoLiu/Classproject_VIL
0 1 00
FuxiaoLiu/CMSC722_project
1 0
FuxiaoLiu/fuxiaoliu.github.io
Language:HTML
FuxiaoLiu/GoodNews
Good News Everyone! - CVPR 2019
Language:Python0 0
FuxiaoLiu/LLaVA
Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
Language:Python0 0
FuxiaoLiu/LRV
Language:JavaScript1 0
FuxiaoLiu/M3Exam
Data and code for paper "M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models"
Language:Python0 0
FuxiaoLiu/MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
Language:Python0 0
FuxiaoLiu/mplug_implementation_fl
FuxiaoLiu/open_clip
An open source implementation of CLIP.
Language:Python0 0
FuxiaoLiu/Recommendation-System
Language:Python1 0
FuxiaoLiu/Role-Embedding
1 0
FuxiaoLiu/SAT
Language:Python1 1
FuxiaoLiu/self-instruct
Aligning pretrained language models with instruction data generated by themselves.
Language:Python0 0
FuxiaoLiu/TCP
[NeurIPS 2022] Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline.
Language:Python0 0
FuxiaoLiu/tool4ipp
This repository contains a data conversion tool for Image Position Prediction task proposed in our paper
Language:Python0 0
FuxiaoLiu/Twitter-COMMs
Language:Python0 0
FuxiaoLiu/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Language:Python0 0