Similar Items: StateVLM: A State-Aware Vision-Language Model for Robotic Affordance Reasoning
- Task-Aware Scanning Parameter Configuration for Robotic Inspection Using Vision Language Embeddings and Hyperdimensional Computing
- Wasserstein-Aligned Localisation for VLM-Based Distributional OOD Detection in Medical Imaging
- Large Language Models are Universal Reasoners for Visual Generation
- Quantifying the human visual exposome with vision language models
- Object Hallucination-Free Reinforcement Unlearning for Vision-Language Models
- Affordance Agent Harness: Verification-Gated Skill Orchestration