Similar Items: Characterizing the Consistency of the Emergent Misalignment Persona
- VecCISC: Improving Confidence-Informed Self-Consistency with Reasoning Trace Clustering and Candidate Answer Selection
- Collaborative Agent Reasoning Engineering (CARE): A Three-Party Design Methodology for Systematically Engineering AI Agents with Subject Matter Experts, Developers, and Helper Agents
- RHyVE: Competence-Aware Verification and Phase-Aware Deployment for LLM-Generated Reward Hypotheses
- To Build or Not to Build? Factors that Lead to Non-Development or Abandonment of AI Systems
- Agent-Agnostic Evaluation of SQL Accuracy in Production Text-to-SQL Systems
- What Makes a Good Terminal-Agent Benchmark Task: A Guideline for Adversarial, Difficult, and Legible Evaluation Design