/rlfh-gen-div

This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity

Primary LanguagePythonOtherNOASSERTION

Watchers