Similar Items: Synthetic Users, Real Differences: an Evaluation Framework for User Simulation in Multi-Turn Conversations