Similar Items: CalBench: Evaluating Coordination-Privacy Trade-offs in Multi-Agent LLMs
- Coordination Matters: Evaluation of Cooperative Multi-Agent Reinforcement Learning
- Coordination as an Architectural Layer for LLM-Based Multi-Agent Systems
- QKVShare: Quantized KV-Cache Handoff for Multi-Agent On-Device LLMs
- SWE-WebDevBench: Evaluating Coding Agent Application Platforms as Virtual Software Agencies
- TraceFix: Repairing Agent Coordination Protocols with TLA+ Counterexamples
- SOTOPIA-TOM: Evaluating Information Management in Multi-Agent Interaction with Theory of Mind