Channels - No More, No Less: Task Alignment in Terminal Agents :: FRELIP Discovery

Similar Items: No More, No Less: Task Alignment in Terminal Agents

Quick Look
LoopTrap: Termination Poisoning Attacks on LLM Agents
Quick Look
When Alignment Isn't Enough: Response-Path Attacks on LLM Agents
Quick Look
Safety Context Injection: Inference-Time Safety Alignment via Static Filtering and Agentic Analysis
Quick Look
STARE: Step-wise Temporal Alignment and Red-teaming Engine for Multi-modal Toxicity Attack
Quick Look
You Snooze, You Lose: Automatic Safety Alignment Restoration through Neural Weight Translation
Quick Look
Stateful Agent Backdoor
Quick Look
AgentTrust: Runtime Safety Evaluation and Interception for AI Agent Tool Use
Quick Look
Agentic Vulnerability Reasoning on Windows COM Binaries
Quick Look
Behavioral Integrity Verification for AI Agent Skills
Quick Look
Profiling for Pennies: Unveiling the Privacy Iceberg of LLM Agents
Quick Look
Engineering Robustness into Personal Agents with the AI Workflow Store
Quick Look
Five Attacks on x402 Agentic Payment Protocol
Quick Look
MOSAIC-Bench: Measuring Compositional Vulnerability Induction in Coding Agents
Quick Look
From Controlled to the Wild: Evaluation of Pentesting Agents for the Real-World
Quick Look
No Attack Required: Semantic Fuzzing for Specification Violations in Agent Skills
Quick Look
Semia: Auditing Agent Skills via Constraint-Guided Representation Synthesis
Quick Look
ARGUS: Defending LLM Agents Against Context-Aware Prompt Injection
Quick Look
Redefining AI Red Teaming in the Agentic Era: From Weeks to Hours
Quick Look
LITMUS: Benchmarking Behavioral Jailbreaks of LLM Agents in Real OS Environments
Quick Look
Proteus: A Self-Evolving Red Team for Agent Skill Ecosystems
Quick Look
SkCC: Portable and Secure Skill Compilation for Cross-Framework LLM Agents
Quick Look
MEMSAD: Gradient-Coupled Anomaly Detection for Memory Poisoning in Retrieval-Augmented Agents
Quick Look
SkillScope: Toward Fine-Grained Least-Privilege Enforcement for Agent Skills
Quick Look
CyBiasBench: Benchmarking Bias in LLM Agents for Cyber-Attack Scenarios