ai-forever/MERA
MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundamental models.
Jupyter NotebookMIT
Issues
- 12
no targets in rummlu and others benchmarks
#23 opened by thehir0 - 2
- 2
Как добавить форматирование промпта?
#4 opened by dmitrymailk - 0
влияние промпта на результаты бенчмарков
#20 opened by vlsav - 9
0.4.0 lm-evaluation-harness
#15 opened by germanjke - 2
- 0
Не авторизоваться на сайте mera.a-ai.ru
#17 opened by preduct0r - 1
Скоринг GGUF моделей
#16 opened by konductor000 - 1
Значения логов бенчмарка
#14 opened by thehumit - 1
- 3
empty value rummlu
#3 opened by mizinovmv - 1
tokenizer does not have a padding token
#9 opened by razikov - 2
Ошибка при сабмитах на mera.a-ai.ru
#7 opened by GorbetskiyDmitriy - 1