<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>TSG Lab – Technical Safety &amp; Governance Lab</title><link>https://tsglab.github.io/</link><atom:link href="https://tsglab.github.io/index.xml" rel="self" type="application/rss+xml"/><description>TSG Lab – Technical Safety &amp; Governance Lab</description><generator>Hugo Blox Builder (https://hugoblox.com)</generator><language>en-us</language><lastBuildDate>Mon, 01 Jan 2024 00:00:00 +0000</lastBuildDate><image><url>https://tsglab.github.io/media/logo.svg</url><title>TSG Lab – Technical Safety &amp; Governance Lab</title><link>https://tsglab.github.io/</link></image><item><title>AutoControl Arena: Synthesizing Executable Test Environments for Frontier AI Risk Evaluation</title><link>https://tsglab.github.io/publication/autocontrol-arena/</link><pubDate>Sun, 01 Mar 2026 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/autocontrol-arena/</guid><description/></item><item><title>Old Habits Die Hard: How Conversational History Geometrically Traps LLMs</title><link>https://tsglab.github.io/publication/old-habits-die-hard-conversational-history/</link><pubDate>Sun, 01 Mar 2026 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/old-habits-die-hard-conversational-history/</guid><description/></item><item><title>Token Taxes: Mitigating AGI's Economic Risks</title><link>https://tsglab.github.io/publication/token-taxes-agi-economic-risks/</link><pubDate>Sun, 01 Mar 2026 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/token-taxes-agi-economic-risks/</guid><description/></item><item><title>Same Answer, Different Representations: Hidden Instability in VLMs</title><link>https://tsglab.github.io/publication/same-answer-different-representations/</link><pubDate>Sun, 01 Feb 2026 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/same-answer-different-representations/</guid><description/></item><item><title>The Hitchhiker's Guide to Actionable Interpretability</title><link>https://tsglab.github.io/publication/hitchhikers-guide-actionable-interpretability/</link><pubDate>Thu, 15 Jan 2026 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/hitchhikers-guide-actionable-interpretability/</guid><description/></item><item><title>Agentic Product Maturity Ladder V0.1</title><link>https://tsglab.github.io/publication/agentic-product-maturity-ladder/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/agentic-product-maturity-ladder/</guid><description/></item><item><title>Automated Interpretability-Driven Model Auditing and Control: A Research Agenda</title><link>https://tsglab.github.io/publication/automated-interpretability-model-auditing/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/automated-interpretability-model-auditing/</guid><description/></item><item><title>Interpretability Can Be Actionable</title><link>https://tsglab.github.io/publication/interpretability-can-be-actionable/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/interpretability-can-be-actionable/</guid><description/></item><item><title>Quantifying the Effect of Test Set Contamination on Generative Evaluations</title><link>https://tsglab.github.io/publication/quantifying-test-set-contamination/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/quantifying-test-set-contamination/</guid><description/></item><item><title>The Capability Frontier: Benchmarks Miss 82% of Model Performance</title><link>https://tsglab.github.io/publication/capability-frontier-benchmarks/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/capability-frontier-benchmarks/</guid><description/></item><item><title>When AI Systems Learn During Deployment, Our Safety Evaluations Break</title><link>https://tsglab.github.io/publication/safety-evaluations-break-deployment/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/safety-evaluations-break-deployment/</guid><description/></item><item><title>Context Matters: Analyzing the Generalizability of Linear Probing and Steering Across Diverse Scenarios</title><link>https://tsglab.github.io/publication/context-matters-linear-probing-steering/</link><pubDate>Mon, 01 Dec 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/context-matters-linear-probing-steering/</guid><description/></item><item><title>Emerging Risks from Embodied AI Require Urgent Policy Action</title><link>https://tsglab.github.io/publication/emerging-risks-embodied-ai/</link><pubDate>Mon, 01 Dec 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/emerging-risks-embodied-ai/</guid><description/></item><item><title>Establishing Best Practices for Building Rigorous Agentic Benchmarks</title><link>https://tsglab.github.io/publication/agentic-benchmarks-best-practices/</link><pubDate>Mon, 01 Dec 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/agentic-benchmarks-best-practices/</guid><description/></item><item><title>Full-Stack Alignment: Co-Aligning AI and Institutions with Thicker Models of Value</title><link>https://tsglab.github.io/publication/full-stack-alignment-institutions/</link><pubDate>Mon, 01 Dec 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/full-stack-alignment-institutions/</guid><description/></item><item><title>Beyond Linear Steering: Unified Multi-Attribute Control for Language Models</title><link>https://tsglab.github.io/publication/beyond-linear-steering-multi-attribute-control/</link><pubDate>Sat, 01 Nov 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/beyond-linear-steering-multi-attribute-control/</guid><description/></item><item><title>Precise In-Parameter Concept Erasure in Large Language Models</title><link>https://tsglab.github.io/publication/precise-concept-erasure-llms/</link><pubDate>Sat, 01 Nov 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/precise-concept-erasure-llms/</guid><description/></item><item><title>Same Question, Different Words: A Latent Adversarial Framework for Prompt Robustness</title><link>https://tsglab.github.io/publication/latent-adversarial-prompt-robustness/</link><pubDate>Sat, 01 Nov 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/latent-adversarial-prompt-robustness/</guid><description/></item><item><title>Trust Me, I'm Wrong: High-Certainty Hallucinations in LLMs</title><link>https://tsglab.github.io/publication/trust-me-im-wrong-hallucinations/</link><pubDate>Sat, 01 Nov 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/trust-me-im-wrong-hallucinations/</guid><description/></item><item><title>Chain-of-Thought Hijacking</title><link>https://tsglab.github.io/publication/chain-of-thought-hijacking/</link><pubDate>Wed, 01 Oct 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/chain-of-thought-hijacking/</guid><description/></item><item><title>HACK: Hallucinations Along Certainty and Knowledge Axes</title><link>https://tsglab.github.io/publication/hack-hallucinations-certainty-knowledge/</link><pubDate>Wed, 01 Oct 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/hack-hallucinations-certainty-knowledge/</guid><description/></item><item><title>Rethinking Safety in LLM Fine-Tuning: An Optimization Perspective</title><link>https://tsglab.github.io/publication/rethinking-safety-llm-finetuning/</link><pubDate>Wed, 01 Oct 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/rethinking-safety-llm-finetuning/</guid><description/></item><item><title>Val-Bench: Measuring Value Alignment in Language Models</title><link>https://tsglab.github.io/publication/val-bench-value-alignment/</link><pubDate>Wed, 01 Oct 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/val-bench-value-alignment/</guid><description/></item><item><title>Beyond Linear Probes: Dynamic Safety Monitoring for Language Models</title><link>https://tsglab.github.io/publication/dynamic-safety-monitoring-linear-probes/</link><pubDate>Mon, 01 Sep 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/dynamic-safety-monitoring-linear-probes/</guid><description/></item><item><title>Query Circuits: Explaining How Language Models Answer User Prompts</title><link>https://tsglab.github.io/publication/query-circuits/</link><pubDate>Mon, 01 Sep 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/query-circuits/</guid><description/></item><item><title>Towards Understanding Subliminal Learning: When and How Hidden Biases Transfer</title><link>https://tsglab.github.io/publication/subliminal-learning-hidden-biases/</link><pubDate>Mon, 01 Sep 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/subliminal-learning-hidden-biases/</guid><description/></item><item><title>Do Sparse Autoencoders Generalize? A Case Study of Answerability</title><link>https://tsglab.github.io/publication/sparse-autoencoders-generalize-answerability/</link><pubDate>Tue, 01 Jul 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/sparse-autoencoders-generalize-answerability/</guid><description/></item><item><title>PoisonBench: Assessing Large Language Model Vulnerability to Data Poisoning</title><link>https://tsglab.github.io/publication/poisonbench/</link><pubDate>Tue, 01 Jul 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/poisonbench/</guid><description/></item><item><title>Scaling Sparse Feature Circuit Finding for In-Context Learning</title><link>https://tsglab.github.io/publication/scaling-sparse-feature-circuit-finding/</link><pubDate>Tue, 01 Jul 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/scaling-sparse-feature-circuit-finding/</guid><description/></item><item><title>Beyond Monoliths: Expert Orchestration for More Capable, Democratic, and Safe Language Models</title><link>https://tsglab.github.io/publication/beyond-monoliths-expert-orchestration/</link><pubDate>Sun, 01 Jun 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/beyond-monoliths-expert-orchestration/</guid><description/></item><item><title>In Which Areas of Technical AI Safety Could Geopolitical Rivals Cooperate?</title><link>https://tsglab.github.io/publication/geopolitical-rivals-ai-safety-cooperation/</link><pubDate>Sun, 01 Jun 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/geopolitical-rivals-ai-safety-cooperation/</guid><description/></item><item><title>The Singapore Consensus on Global AI Safety Research Priorities</title><link>https://tsglab.github.io/publication/singapore-consensus-ai-safety/</link><pubDate>Sun, 01 Jun 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/singapore-consensus-ai-safety/</guid><description/></item><item><title>SafetyNet: Detecting Harmful Outputs in LLMs by Modeling and Monitoring Deceptive Behaviors</title><link>https://tsglab.github.io/publication/safetynet-deceptive-behaviors/</link><pubDate>Thu, 01 May 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/safetynet-deceptive-behaviors/</guid><description/></item><item><title>Rethinking AI Cultural Alignment</title><link>https://tsglab.github.io/publication/rethinking-ai-cultural-alignment/</link><pubDate>Tue, 01 Apr 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/rethinking-ai-cultural-alignment/</guid><description/></item><item><title>Towards Interpreting Visual Information Processing in Vision-Language Models</title><link>https://tsglab.github.io/publication/visual-information-processing-vlms/</link><pubDate>Tue, 01 Apr 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/visual-information-processing-vlms/</guid><description/></item><item><title>AILuminate: Introducing v1.0 of the AI Risk and Reliability Benchmark from MLCommons</title><link>https://tsglab.github.io/publication/ailuminate-mlcommons/</link><pubDate>Sat, 01 Mar 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/ailuminate-mlcommons/</guid><description/></item><item><title>Chain-of-Thought Is Not Explainability</title><link>https://tsglab.github.io/publication/chain-of-thought-not-explainability/</link><pubDate>Sat, 01 Feb 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/chain-of-thought-not-explainability/</guid><description/></item><item><title>Open Problems in Machine Unlearning for AI Safety</title><link>https://tsglab.github.io/publication/open-problems-machine-unlearning/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/open-problems-machine-unlearning/</guid><description/></item><item><title>Plan B: Training LLMs to Fail Less Severely</title><link>https://tsglab.github.io/publication/plan-b-llms-fail-less-severely/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/plan-b-llms-fail-less-severely/</guid><description/></item><item><title>Safety Frameworks and Standards: A Comparative Analysis to Advance Risk Management of Frontier AI</title><link>https://tsglab.github.io/publication/safety-frameworks-standards/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/safety-frameworks-standards/</guid><description/></item><item><title>Toward Resisting AI-Enabled Authoritarianism</title><link>https://tsglab.github.io/publication/resisting-ai-authoritarianism/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/resisting-ai-authoritarianism/</guid><description/></item><item><title>Verification for International AI Governance</title><link>https://tsglab.github.io/publication/verification-international-ai-governance/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/verification-international-ai-governance/</guid><description/></item><item><title>Best-of-N Jailbreaking</title><link>https://tsglab.github.io/publication/best-of-n-jailbreaking/</link><pubDate>Sun, 01 Dec 2024 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/best-of-n-jailbreaking/</guid><description/></item><item><title>Interpreting Learned Feedback Patterns in Large Language Models</title><link>https://tsglab.github.io/publication/interpreting-feedback-patterns-llms/</link><pubDate>Sun, 01 Dec 2024 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/interpreting-feedback-patterns-llms/</guid><description/></item><item><title>Jailbreak Defense in a Narrow Domain: Limitations of Existing Methods and a New Transcript-Classifier Approach</title><link>https://tsglab.github.io/publication/jailbreak-defense-narrow-domain/</link><pubDate>Sun, 01 Dec 2024 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/jailbreak-defense-narrow-domain/</guid><description/></item><item><title>Enhancing Neural Network Interpretability with Feature-Aligned Sparse Autoencoders</title><link>https://tsglab.github.io/publication/feature-aligned-sparse-autoencoders/</link><pubDate>Fri, 01 Nov 2024 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/feature-aligned-sparse-autoencoders/</guid><description/></item><item><title>Interpreting Context Look-ups in Transformers: Investigating Attention-MLP Interactions</title><link>https://tsglab.github.io/publication/attention-mlp-interactions/</link><pubDate>Fri, 01 Nov 2024 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/attention-mlp-interactions/</guid><description/></item><item><title>Towards Interpretable Sequence Continuation: Analyzing Shared Circuits in Large Language Models</title><link>https://tsglab.github.io/publication/interpretable-sequence-continuation/</link><pubDate>Fri, 01 Nov 2024 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/interpretable-sequence-continuation/</guid><description/></item><item><title>Quantifying Feature Space Universality Across Large Language Models via Sparse Autoencoders</title><link>https://tsglab.github.io/publication/feature-space-universality-sparse-autoencoders/</link><pubDate>Tue, 01 Oct 2024 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/feature-space-universality-sparse-autoencoders/</guid><description/></item><item><title>Large Language Models Relearn Removed Concepts</title><link>https://tsglab.github.io/publication/llms-relearn-removed-concepts/</link><pubDate>Thu, 01 Aug 2024 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/llms-relearn-removed-concepts/</guid><description/></item><item><title>Mechanistic Interpretability Workshop at ICML 2024</title><link>https://tsglab.github.io/publication/mechanistic-interpretability-workshop-icml-2024/</link><pubDate>Mon, 01 Jul 2024 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/mechanistic-interpretability-workshop-icml-2024/</guid><description/></item><item><title>Position: Near to Mid-Term Risks and Opportunities of Open-Source Generative AI</title><link>https://tsglab.github.io/publication/open-source-generative-ai-risks/</link><pubDate>Mon, 01 Jul 2024 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/open-source-generative-ai-risks/</guid><description/></item><item><title>The Scaling Behavior of Large Language Models</title><link>https://tsglab.github.io/publication/scaling-behavior-llms/</link><pubDate>Mon, 01 Jul 2024 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/scaling-behavior-llms/</guid><description/></item><item><title>Visualizing Neural Network Imagination</title><link>https://tsglab.github.io/publication/visualizing-neural-network-imagination/</link><pubDate>Mon, 01 Jul 2024 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/visualizing-neural-network-imagination/</guid><description/></item><item><title>Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models</title><link>https://tsglab.github.io/publication/sycophancy-to-subterfuge/</link><pubDate>Sat, 01 Jun 2024 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/sycophancy-to-subterfuge/</guid><description/></item><item><title>Understanding Addition in Transformers</title><link>https://tsglab.github.io/publication/understanding-addition-transformers/</link><pubDate>Wed, 01 May 2024 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/understanding-addition-transformers/</guid><description/></item><item><title>Increasing Trust in Language Models Through the Reuse of Verified Circuits</title><link>https://tsglab.github.io/publication/verified-circuits-trust/</link><pubDate>Thu, 01 Feb 2024 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/verified-circuits-trust/</guid><description/></item><item><title>Contact</title><link>https://tsglab.github.io/contact/</link><pubDate>Mon, 01 Jan 2024 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/contact/</guid><description/></item><item><title>Research</title><link>https://tsglab.github.io/research/</link><pubDate>Mon, 01 Jan 2024 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/research/</guid><description/></item><item><title>Safeguarding AI in Finance: Lessons for Regulated Industries</title><link>https://tsglab.github.io/publication/safeguarding-ai-finance/</link><pubDate>Mon, 01 Jan 2024 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/safeguarding-ai-finance/</guid><description/></item><item><title>Sleeper Agents: Training Deceptive LLMs That Persist Through Safety Training</title><link>https://tsglab.github.io/publication/sleeper-agents/</link><pubDate>Mon, 01 Jan 2024 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/sleeper-agents/</guid><description/></item><item><title>Vacancies</title><link>https://tsglab.github.io/vacancies/</link><pubDate>Mon, 01 Jan 2024 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/vacancies/</guid><description/></item><item><title>What Does GPT Store in Its MLP Weights? A Case Study of Long-Range Dependencies</title><link>https://tsglab.github.io/publication/gpt-mlp-weights-long-range/</link><pubDate>Mon, 01 Jan 2024 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/gpt-mlp-weights-long-range/</guid><description/></item><item><title>DeepDecipher: Accessing and Investigating Neuron Activation in Large Language Models</title><link>https://tsglab.github.io/publication/deepdecipher/</link><pubDate>Fri, 01 Dec 2023 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/deepdecipher/</guid><description/></item><item><title>Measuring Value Alignment</title><link>https://tsglab.github.io/publication/measuring-value-alignment/</link><pubDate>Fri, 01 Dec 2023 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/measuring-value-alignment/</guid><description/></item><item><title>AI Systems of Concern</title><link>https://tsglab.github.io/publication/ai-systems-of-concern/</link><pubDate>Sun, 01 Oct 2023 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/ai-systems-of-concern/</guid><description/></item><item><title>Detecting Edit Failures in Large Language Models: An Improved Specificity Benchmark</title><link>https://tsglab.github.io/publication/detecting-edit-failures-llms/</link><pubDate>Sat, 01 Jul 2023 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/detecting-edit-failures-llms/</guid><description/></item><item><title>The Larger They Are, the Harder They Fail: Language Models Do Not Recognize Identifier Swaps in Python</title><link>https://tsglab.github.io/publication/identifier-swaps-python/</link><pubDate>Sat, 01 Jul 2023 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/identifier-swaps-python/</guid><description/></item><item><title>Neuron to Graph: Interpreting Language Model Neurons at Scale</title><link>https://tsglab.github.io/publication/neuron-to-graph/</link><pubDate>Mon, 01 May 2023 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/neuron-to-graph/</guid><description/></item><item><title>Fairness in AI and Its Long-Term Implications on Society</title><link>https://tsglab.github.io/publication/fairness-ai-long-term-implications/</link><pubDate>Sun, 01 Jan 2023 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/fairness-ai-long-term-implications/</guid><description/></item><item><title>Identifying a Preliminary Circuit for Predicting Gendered Pronouns in GPT-2 Small</title><link>https://tsglab.github.io/publication/circuit-gendered-pronouns-gpt2/</link><pubDate>Sun, 01 Jan 2023 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/circuit-gendered-pronouns-gpt2/</guid><description/></item><item><title>The Alan Turing Institute's Response to the House of Lords Large Language Models Call for Evidence</title><link>https://tsglab.github.io/publication/turing-institute-lords-llm-evidence/</link><pubDate>Sun, 01 Jan 2023 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/turing-institute-lords-llm-evidence/</guid><description/></item><item><title>System III: Learning with Domain Knowledge for Safety Constraints</title><link>https://tsglab.github.io/publication/system-iii-safety-constraints/</link><pubDate>Thu, 01 Dec 2022 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/publication/system-iii-safety-constraints/</guid><description/></item><item><title>People</title><link>https://tsglab.github.io/people/</link><pubDate>Mon, 24 Oct 2022 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/people/</guid><description/></item><item><title/><link>https://tsglab.github.io/admin/config.yml</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://tsglab.github.io/admin/config.yml</guid><description/></item></channel></rss>