ZhangLin009

Pinned Repositories

screen_qa
ScreenQA dataset was introduced in the "ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots" paper. It contains ~86K question-answer pairs collected by human annotators for ~35K screenshots from Rico. It should be used to train and evaluate models capable of screen content understanding via question answering.
Language:Python107 6 37
Qwen2.5-VL
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Jupyter Notebook8.9k 56 781628
CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
Language:Python2.3k 29 183152
WildVision-Bench
Language:Python15 2 23
GildedRose-Refactoring-Kata
Starting code for the GildedRose Refactoring Kata in many programming languages.
Language:Python0 0 00

ZhangLin009/GildedRose-Refactoring-Kata
Starting code for the GildedRose Refactoring Kata in many programming languages.
Language:Python0 0 00