Similar Items: The Attacker in the Mirror: Breaking Self-Consistency in Safety via Anchored Bipolicy Self-Play
- Fast Rates in $α$-Potential Games via Regularized Mirror Descent
- Your Loss is My Gain: Low Stake Attacks on Liquid Staking Pools
- What Suppresses Nash Equilibrium Play in Large Language Models? Mechanistic Evidence and Causal Control
- Why Do LLMs Struggle in Strategic Play? Broken Links Between Observations, Beliefs, and Actions
- Unsecured Lending via Delegated Underwriting
- In-Context Credit Assignment via the Core