Similar Items: On the Hardness of Junking LLMs
- Verifier-Backed Hard Problem Generation for Mathematical Reasoning
- Exploration Hacking: Can LLMs Learn to Resist RL Training?
- U-Define: Designing User Workflows for Hard and Soft Constraints in LLM-Based Planning
- Early Detection of Water Stress by Plant Electrophysiology: Machine Learning for Irrigation Management
- Exponential families from a single KL identity
- TopBench: A Benchmark for Implicit Prediction and Reasoning over Tabular Question Answering