Similar Items: Towards Apples to Apples for AI Evaluations: From Real-World Use Cases to Evaluation Scenarios
- "It depends on where AI is used": Players' attitude patterns and evaluative logics toward different AI applications in digital games
- Principles and Guidelines for Randomized Controlled Trials in AI Evaluation
- UX in the Age of AI: Rethinking Evaluation Metrics Through a Statistical Lens
- AI and Consciousness: Shifting Focus Towards Tractable Questions
- The Missing Evaluation Axis: What 10,000 Student Submissions Reveal About AI Tutor Effectiveness
- Cripping AI: Reimagining AI Through Lived Disability Experiences