Similar Items: Reinforcement Learning for LLM-based Multi-Agent Systems through Orchestration Traces
- MASPO: Joint Prompt Optimization for LLM-based Multi-Agent Systems
- StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction
- STALE: Can LLM Agents Know When Their Memories Are No Longer Valid?
- Cited but Not Verified: Parsing and Evaluating Source Attribution in LLM Deep Research Agents
- Stable Behavior, Limited Variation: Persona Validity in LLM Agents for Urban Sentiment Perception
- Models Recall What They Violate: Constraint Adherence in Multi-Turn LLM Ideation