Understanding Addition in Transformers

Publication
ICLR 2024