Similar Items: Fairness of Classifiers in the Presence of Constraints between Features
- MaD Physics: Evaluating information seeking under constraints in physical environments
- Reason to Play: Behavioral and Brain Alignment Between Frontier LRMs and Human Game Learners
- Collaborative Agent Reasoning Engineering (CARE): A Three-Party Design Methodology for Systematically Engineering AI Agents with Subject Matter Experts, Developers, and Helper Agents
- RHyVE: Competence-Aware Verification and Phase-Aware Deployment for LLM-Generated Reward Hypotheses
- To Build or Not to Build? Factors that Lead to Non-Development or Abandonment of AI Systems
- Agent-Agnostic Evaluation of SQL Accuracy in Production Text-to-SQL Systems