Similar Items: Abductive Reasoning with Probabilistic Commonsense
- First-Order Efficiency for Probabilistic Value Estimation via A Statistical Viewpoint
- Rubric-Grounded RL: Structured Judge Rewards for Generalizable Reasoning
- Reason to Play: Behavioral and Brain Alignment Between Frontier LRMs and Human Game Learners
- VecCISC: Improving Confidence-Informed Self-Consistency with Reasoning Trace Clustering and Candidate Answer Selection
- Collaborative Agent Reasoning Engineering (CARE): A Three-Party Design Methodology for Systematically Engineering AI Agents with Subject Matter Experts, Developers, and Helper Agents
- RHyVE: Competence-Aware Verification and Phase-Aware Deployment for LLM-Generated Reward Hypotheses