Similar Items: Reinforcement Learning for LLM-based Multi-Agent Systems through Orchestration Traces