Similar Items: Compress Then Adapt? No, Do It Together via Task-aware Union of Subspaces
- RHyVE: Competence-Aware Verification and Phase-Aware Deployment for LLM-Generated Reward Hypotheses
- What Makes a Good Terminal-Agent Benchmark Task: A Guideline for Adversarial, Difficult, and Legible Evaluation Design
- Crab: A Semantics-Aware Checkpoint/Restore Runtime for Agent Sandboxes
- SCPRM: A Schema-aware Cumulative Process Reward Model for Knowledge Graph Question Answering
- AI CFD Scientist: Toward Open-Ended Computational Fluid Dynamics Discovery with Physics-Aware AI Agents
- Empowering Heterogeneous Graph Foundation Models via Decoupled Relation Alignment