openai/automated-interpretability

About Direction Finding

Closed this issue · 2 comments

Dear authors, do you plan to open source the “Finding explainable directions” part of the code in the future? Thanks.

Sorry to bother, could you link to the "Finding explainable directions" part in the Repo? I would like to understand the question better. Thank you.