Identifying a Preliminary Circuit for Predicting Gendered Pronouns in GPT-2 Small

Publication
Preprint