Similar Items: Trajectory as the Teacher: Few-Step Discrete Flow Matching via Energy-Navigated Distillation
- Continuous-Time Distribution Matching for Few-Step Diffusion Distillation
- KL for a KL: On-Policy Distillation with Control Variate Baseline
- Uncertainty-Aware Structured Data Extraction from Full CMR Reports via Distilled LLMs
- StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction
- OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories
- When LLMs Stop Following Steps: A Diagnostic Study of Procedural Execution in Language Models