Pinned Repositories
arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
llm-decontaminator
Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
lm-sys.github.io
vicuna-blog-eval
The code and data for the GPT-4 based benchmark in the vicuna blog post
LMSYS's Repositories
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
lm-sys/arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
lm-sys/llm-decontaminator
Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
lm-sys/lm-sys.github.io
lm-sys/vicuna-blog-eval
The code and data for the GPT-4 based benchmark in the vicuna blog post