Similar Items: SkillSafetyBench: Evaluating Agent Safety under Skill-Facing Attack Surfaces
- Skill Description Deception Attack against Task Routing in Internet of Agents
- CodeClinic: Evaluating Automation of Coding Skills for Clinical Reasoning Agents
- SkillOps: Managing LLM Agent Skill Libraries as Self-Maintaining Software Ecosystems
- CalBench: Evaluating Coordination-Privacy Trade-offs in Multi-Agent LLMs
- Attacks and Mitigations for Distributed Governance of Agentic AI under Byzantine Adversaries
- Bian Que: An Agentic Framework with Flexible Skill Arrangement for Online System Operations