Pinned Repositories
screen_qa
ScreenQA dataset was introduced in the "ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots" paper. It contains ~86K question-answer pairs collected by human annotators for ~35K screenshots from Rico. It should be used to train and evaluate models capable of screen content understanding via question answering.
Qwen2.5-VL
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
WildVision-Bench
GildedRose-Refactoring-Kata
Starting code for the GildedRose Refactoring Kata in many programming languages.
ZhangLin009's Repositories
ZhangLin009/GildedRose-Refactoring-Kata
Starting code for the GildedRose Refactoring Kata in many programming languages.