S. B. Cohen | TSG Lab – Technical Safety & Governance Lab

S. B. Cohen | TSG Lab – Technical Safety & Governance Labhttps://tsglab.github.io/author/s.-b.-cohen/S. B. CohenHugo Blox Builder (https://hugoblox.com)en-usSun, 01 Mar 2026 00:00:00 +0000https://tsglab.github.io/media/logo.svgS. B. Cohenhttps://tsglab.github.io/author/s.-b.-cohen/Old Habits Die Hard: How Conversational History Geometrically Traps LLMshttps://tsglab.github.io/publication/old-habits-die-hard-conversational-history/Sun, 01 Mar 2026 00:00:00 +0000https://tsglab.github.io/publication/old-habits-die-hard-conversational-history/PoisonBench: Assessing Large Language Model Vulnerability to Data Poisoninghttps://tsglab.github.io/publication/poisonbench/Tue, 01 Jul 2025 00:00:00 +0000https://tsglab.github.io/publication/poisonbench/Interpreting Context Look-ups in Transformers: Investigating Attention-MLP Interactionshttps://tsglab.github.io/publication/attention-mlp-interactions/Fri, 01 Nov 2024 00:00:00 +0000https://tsglab.github.io/publication/attention-mlp-interactions/Large Language Models Relearn Removed Conceptshttps://tsglab.github.io/publication/llms-relearn-removed-concepts/Thu, 01 Aug 2024 00:00:00 +0000https://tsglab.github.io/publication/llms-relearn-removed-concepts/The Scaling Behavior of Large Language Modelshttps://tsglab.github.io/publication/scaling-behavior-llms/Mon, 01 Jul 2024 00:00:00 +0000https://tsglab.github.io/publication/scaling-behavior-llms/What Does GPT Store in Its MLP Weights? A Case Study of Long-Range Dependencieshttps://tsglab.github.io/publication/gpt-mlp-weights-long-range/Mon, 01 Jan 2024 00:00:00 +0000https://tsglab.github.io/publication/gpt-mlp-weights-long-range/The Larger They Are, the Harder They Fail: Language Models Do Not Recognize Identifier Swaps in Pythonhttps://tsglab.github.io/publication/identifier-swaps-python/Sat, 01 Jul 2023 00:00:00 +0000https://tsglab.github.io/publication/identifier-swaps-python/