Re-run Mistral Evals w/ official Mistral-Instruct
Closed this issue · 0 comments
neubig commented
We used a third-party model instead of the official Mixtral model in our original evaluation: https://twitter.com/arthurmensch/status/1737138144854606314
We should:
- Re-run the Mixtral evals for tasks with the official Mixtral Instruct model
- Update the numbers, figures, and discussion in each task section of the paper
- Update the Zeno report to match the paper content
Here is a checklist for the tasks, check off each task when this is done please!
- Knowledge-based QA
- Reasoning
- Mathematics
- Code Generation
- Translation
- Web Instruction Following