openai/automated-interpretability

Is there a demo that shows this great project?

guotong1988 opened this issue · 1 comments

Thank you very much!

[disclaimer: i am the creator of neuronpedia]

check out neuronpedia.org. it uses automated-interpretability for scoring gpt2-small neuron explanations - and it lets anyone contribute their own explanations too.