/rlhf-shakespeare

Shakespeare transformer fine-tuned to generate positive sentiment samples using RLHF

Primary LanguagePython

Watchers