Increasing Trust in Language Models Through the Reuse of Verified Circuits

Publication
arXiv:2402.02619