Similar Items: CapVector: Learning Transferable Capability Vectors in Parametric Space for Vision-Language-Action Models
- Seeing Realism from Simulation: Efficient Video Transfer for Vision-Language-Action Data Augmentation
- ALAM: Algebraically Consistent Latent Transitions for Vision-Language-Action Models
- Quantifying the human visual exposome with vision language models
- Object Hallucination-Free Reinforcement Unlearning for Vision-Language Models
- MoCapAnything V2: End-to-End Motion Capture for Arbitrary Skeletons
- StateVLM: A State-Aware Vision-Language Model for Robotic Affordance Reasoning