Pinned Repositories
opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
HelloBench
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
quehry.github.io
quehry.github.io
Quehry's Repositories
Quehry/HelloBench
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
Quehry/quehry.github.io
quehry.github.io
Quehry/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.