/LLM_Evaluation

Primary LanguageJupyter Notebook

LLM_Evaluation

mmlu_model_eval_multi_choice.ipynb: Evaluate model with multiple choice style. mmlu_model_eval_cloze_prompt.ipynb: Evaluate model with probabilities of sentences.