Similar Items: A decoupled diffusion planner that adapts to changing cost limits by using cost-conditioned generation for safety and reward gradients for performance
- Conditional Diffusion Sampling
- PET-Adapter: Test-Time Domain Adaptation for Full and Limited-Angle PET Image Reconstruction
- Low-Cost Black-Box Detection of LLM Hallucinations via Dynamical System Prediction
- Themis: Training Robust Multilingual Code Reward Models for Flexible Multi-Criteria Scoring
- FlexiTac: A Low-Cost, Open-Source, Scalable Tactile Sensing Solution for Robotic Systems
- Rollout Pass-Rate Control: Steering Binary-Reward RL Toward Its Most Informative Regime