EleutherAI/concept-erasure

Applying this during decoding time

Opened this issue · 0 comments

Hi, thanks for repository and paper. Is it possible to apply this to generation tasks in language models and not just classification ?
I am very interested in this aspect. Also, just to confirm, the scrubber is a technique that is applied during inference and doesn't modify model parameters right ? It only modifies hidden representations ?