Pinned Repositories
QiZhenGPT
QiZhenGPT: An Open Source Chinese Medical Large Language Model|一个开源的中文医疗大语言模型
Megatron-LLM
distributed trainer for LLMs
CG-Eval
Chinese Generation Evaluation
factool
FacTool: Factuality Detection in Generative AI
opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
ArXivQA
WIP - Automated Question Answering for ArXiv Papers with Large Language Models (https://arxiv.taesiri.xyz/)
AlignBench
多维度中文对齐评测基准 | Benchmarking Chinese Alignment of LLMs
FLAIR
RefGPT
wwngh1233's Repositories
wwngh1233/AlignBench
多维度中文对齐评测基准 | Benchmarking Chinese Alignment of LLMs
wwngh1233/FLAIR
wwngh1233/RefGPT