Similar Items: CodeClinic: Evaluating Automation of Coding Skills for Clinical Reasoning Agents
- SWE-WebDevBench: Evaluating Coding Agent Application Platforms as Virtual Software Agencies
- Retrieval-Conditioned Topology Selection with Provable Budget Conservation for Multi-Agent Code Generation
- Emergent Communication for Co-constructed Emotion Between Embodied Agents via Collective Predictive Coding
- The Bystander Effect in Multi-Agent Reasoning: Quantifying Cognitive Loafing in Collaborative Interactions
- I Would If I Could: Reasoning about Dynamics of Actions in Multi-Agent Systems
- Skill Description Deception Attack against Task Routing in Internet of Agents