Towards Interpretable Sequence Continuation: Analyzing Shared Circuits in Large Language Models

Publication
EMNLP 2024