facebookresearch/rlfh-gen-div
This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity
PythonNOASSERTION
This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity
PythonNOASSERTION