Similar Items: The First Drop of Ink: Nonlinear Impact of Misleading Information in Long-Context Reasoning
- VecCISC: Improving Confidence-Informed Self-Consistency with Reasoning Trace Clustering and Candidate Answer Selection
- Abductive Reasoning with Probabilistic Commonsense
- Rubric-Grounded RL: Structured Judge Rewards for Generalizable Reasoning
- Reason to Play: Behavioral and Brain Alignment Between Frontier LRMs and Human Game Learners
- Collaborative Agent Reasoning Engineering (CARE): A Three-Party Design Methodology for Systematically Engineering AI Agents with Subject Matter Experts, Developers, and Helper Agents
- First-Order Efficiency for Probabilistic Value Estimation via A Statistical Viewpoint