Similar Items: LoopTrap: Termination Poisoning Attacks on LLM Agents
- When Alignment Isn't Enough: Response-Path Attacks on LLM Agents
- CyBiasBench: Benchmarking Bias in LLM Agents for Cyber-Attack Scenarios
- MEMSAD: Gradient-Coupled Anomaly Detection for Memory Poisoning in Retrieval-Augmented Agents
- Latent Adversarial Detection: Adaptive Probing of LLM Activations for Multi-Turn Attack Detection
- Exposing LLM Safety Gaps Through Mathematical Encoding:New Attacks and Systematic Analysis
- Fight Poison with Poison: Enhancing Robustness in Few-shot Machine-Generated Text Detection with Adversarial Training