Similar Items: LaST-R1: Reinforcing Action via Adaptive Physical Latent Reasoning for VLA Models
- One Token Per Frame: Reconsidering Visual Bandwidth in World Models for VLA Policy
- PhysEdit: Physically-Consistent Region-Aware Image Editing via Adaptive Spatio-Temporal Reasoning
- Continuous Latent Diffusion Language Model
- UnAC: Adaptive Visual Prompting with Abstraction and Stepwise Checking for Complex Multimodal Reasoning
- InpaintSLat: Inpainting Structured 3D Latents via Initial Noise Optimization
- Reconstruction or Semantics? What Makes a Latent Space Useful for Robotic World Models