Similar Items: Autonomous Adversary: Red-Teaming in the age of LLM
- ContextualJailbreak: Evolutionary Red-Teaming via Simulated Conversational Priming
- Redefining AI Red Teaming in the Agentic Era: From Weeks to Hours
- Understanding Adversarial Transferability in Vision-Language Models for Autonomous Driving: A Cross-Architecture Analysis
- FlashRT: Towards Computationally and Memory Efficient Red-Teaming for Prompt Injection and Knowledge Corruption
- STARE: Step-wise Temporal Alignment and Red-teaming Engine for Multi-modal Toxicity Attack
- Latent Adversarial Detection: Adaptive Probing of LLM Activations for Multi-Turn Attack Detection