Interpreting Context Look-ups in Transformers: Investigating Attention-MLP Interactions

Publication
EMNLP 2024