Similar Items: RHyVE: Competence-Aware Verification and Phase-Aware Deployment for LLM-Generated Reward Hypotheses
- SCPRM: A Schema-aware Cumulative Process Reward Model for Knowledge Graph Question Answering
- Rubric-Grounded RL: Structured Judge Rewards for Generalizable Reasoning
- Crab: A Semantics-Aware Checkpoint/Restore Runtime for Agent Sandboxes
- Compress Then Adapt? No, Do It Together via Task-aware Union of Subspaces
- Born-Qualified: An Autonomous Framework for Deploying Advanced Energy and Electronic Materials
- AI CFD Scientist: Toward Open-Ended Computational Fluid Dynamics Discovery with Physics-Aware AI Agents