Pinned Repositories
friend-group-2020
rse-classwork-2020
Travel-guide
The best travel guide ever
SoM
Set-of-Mark Prompting for GPT-4V and LMMs
opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
SafetyBench
Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety. [ACL 2024]
Alexyuanfun's Repositories
Alexyuanfun/friend-group-2020
Alexyuanfun/rse-classwork-2020
Alexyuanfun/Travel-guide
The best travel guide ever