Neuron to Graph: Interpreting Language Model Neurons at Scale

Publication
ICLR 2023 Workshop