Similar Items: The Attacker in the Mirror: Breaking Self-Consistency in Safety via Anchored Bipolicy Self-Play