Similar Items: Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation
- D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models
- MARBLE: Multi-Aspect Reward Balance for Diffusion RL
- Flow-OPD: On-Policy Distillation for Flow Matching Models
- CMTA: Leveraging Cross-Modal Temporal Artifacts for Generalizable AI-Generated Video Detection
- FreeSpec: Training-Free Long Video Generation via Singular-Spectrum Reconstruction
- Unpaired Image Deraining Using Reward-Guided Self-Reinforcement Strategy