prometheus-eval
Codebase to inference and train foundation models specialized on evaluating other foundation models
United States of America
Pinned Repositories
.github
Organization README for prometheus-eval
leaderboard
BiGGen-Bench Leaderboard
prometheus
[ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on a customized score rubric, Prometheus is a good alternative for human evaluation and GPT-4 evaluation.
prometheus-eval
Evaluate your LLM's response with Prometheus and GPT4 💯
prometheus-eval.github.io
Documentation and blogposts for Prometheus
prometheus-vision
[ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on customized score rubric, Prometheus-Vision is a good alternative for human evaluation and GPT-4V evaluation.
prometheus-eval's Repositories
prometheus-eval/prometheus-eval
Evaluate your LLM's response with Prometheus and GPT4 💯
prometheus-eval/prometheus
[ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on a customized score rubric, Prometheus is a good alternative for human evaluation and GPT-4 evaluation.
prometheus-eval/prometheus-vision
[ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on customized score rubric, Prometheus-Vision is a good alternative for human evaluation and GPT-4V evaluation.
prometheus-eval/.github
Organization README for prometheus-eval
prometheus-eval/leaderboard
BiGGen-Bench Leaderboard
prometheus-eval/prometheus-eval.github.io
Documentation and blogposts for Prometheus