Similar Items: SkillOps: Managing LLM Agent Skill Libraries as Self-Maintaining Software Ecosystems
- SkillSafetyBench: Evaluating Agent Safety under Skill-Facing Attack Surfaces
- Safe Multi-Agent Behavior Must Be Maintained, Not Merely Asserted: Constraint Drift in LLM-Based Multi-Agent Systems
- CodeClinic: Evaluating Automation of Coding Skills for Clinical Reasoning Agents
- Skill Description Deception Attack against Task Routing in Internet of Agents
- Bian Que: An Agentic Framework with Flexible Skill Arrangement for Online System Operations
- Skills as Verifiable Artifacts: A Trust Schema and a Biconditional Correctness Criterion for Human-in-the-Loop Agent Runtimes