junyangwang0410
Studying at Beijing Jiaotong University Research intern at Intelligent Computing of Alibaba Group
Beijing Jiaotong UniversityBeijing, China
Pinned Repositories
CoAT
Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)
AMBER
An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation
Attention-LLaVA
A hot-pluggable tool for visualizing LLaVA's attention.
HaELM
An automatic MLLM hallucination detection framework
junyangwang0410.github.io
Knight
SotA text-only image/video method (IJCAI 2023)
MobileAgent
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception
video-demo
MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
mPLUG-HalOwl
mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating
junyangwang0410's Repositories
junyangwang0410/AMBER
An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation
junyangwang0410/HaELM
An automatic MLLM hallucination detection framework
junyangwang0410/Knight
SotA text-only image/video method (IJCAI 2023)
junyangwang0410/Attention-LLaVA
A hot-pluggable tool for visualizing LLaVA's attention.
junyangwang0410/MobileAgent
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception
junyangwang0410/junyangwang0410.github.io
junyangwang0410/video-demo