Similar Items: ScriptHOI: Learning Scripted State Transitions for Open-Vocabulary Human-Object Interaction Detection
- FreeOcc: Training-Free Embodied Open-Vocabulary Occupancy Prediction
- OmniRobotHome: A Multi-Camera Platform for Real-Time Multiadic Human-Robot Interaction
- OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents
- LASE: Language-Adversarial Speaker Encoding for Indic Cross-Script Identity Preservation
- Object Hallucination-Free Reinforcement Unlearning for Vision-Language Models
- Temporally Consistent Object 6D Pose Estimation for Robot Control