Text this: Reinforcement Learning for LLM-based Multi-Agent Systems through Orchestration Traces