SAR-LLM

How to finetuning LLMs with soft targets

accelerate==0.29.3

datasets==2.19.0

lm_eval==0.4.2

tokenizers==0.19.1

torch==2.2.2

transformers==4.40.0

wandb==0.16.2

vllm==0.3.2

To get the tokenized data for calculate the n-gram distribution:

python preprocess_data.py

python n_gram.py

Please adjust the specific configuration parameters based on your Slurm setup:

sbatch sbatch.sh

bash eval_vllm.sh

bash eval_ppl.sh

Get the output of the AlpacaEval dataset:

bash alpaca_eval_output.sh

Get the win rate:

alpaca_eval.sh

For more information, please refer to AlpacaEval.