Similar Items: Towards Apples to Apples for AI Evaluations: From Real-World Use Cases to Evaluation Scenarios